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© A method for the amplification of nucleotide sequences. 

® A method for the arinplification.of a nucleic acid fragment containing unknown sequence and kits therefor are 
^ provided which, enable .tong nucleotide sequences to be rapidly, and efficiently sequenced. The method 
compnses the formation of target nuclek: add fragmenWectorette units by cleavage of target nucleic acid 
followed by ligation. One of the nucleic acid fragments wiU contain an Initiating priming region of known 
sequence for hybridisation with an initiating primer and target nucleic acid fragmentA/ectorette units will contain a 
vectorette pnming region of known sequence lor hybridisation with a vectorette primer. Amplification is effected 
^by pnmer extension of an initiating primer hybridised to the initiating priming region of the target nucleic acid 
^fragmentVectoretteuniL y awu 
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A METHOD FOR THE AMPURCATION OF NUCLEOTIDE SEQUENCES 



The present invention relates to a method for the amplification of nucleotide sequences and kits 
therefor. Such a method is of particular interest In relation to the amplification of sequences only a portion 
of which is known and enables long nucleotide sequences to be rapidly and efficiently sequenced. The 
method avoids the recombinant ON A cloning procedures hitherto necessary for the sequencing of unknown 
5 nucleotide sequences. By so doing it also allows polymorphisms between nucleotide sequences of different 
alleles at a genetic locus to be detected as well as simultaneous analysis of alleles at a particular locus in 
different individuals. 

Nucleotides may exist as individual nucleotides or base pairs (hereinafter also referred to as bp) or in 
strands of nucleotides, each strand containing up to lO' bp or even more. The human genome, for example 

10 is believed to contain approximately 3 x 10^ bp. a single chromosome containing approximately lO'-tO' bp. 
Techniques for sequencing relatively short nucleotide sequences have existed for. many years and include 
the method of Maxam and Gilbert [Maxam A.M. Gilbert W (1977) *A new method for sequencing DNA*. 
Proc. NatLAcad. Sci.. USA 74. 560-564 and "Sequencing end-labelled DNA with base-specific chemical 
cleavages" Methods in Enzymology 65. 499-560 (1980)] in which single- stranded ONA radiolabelled at one 

/5 end. for example with ^^p^ subjected to several chemical cleavage protocols (for example with 
dimethylsulphate or hydrazine) that selectively make breaks on one side of a particular nucleoside. The 
fragments obtained are separated according to size by electrophoresis on acryiamide gels and identified by 
autoradiography. Relatively short nucleotide sequences may also be determined by the enzymatic 
-dideoxy- technique of Sanger et aJ (Proc.Natl.Acad.Sci. USA 74. 5463-7, (1977)1 in which the Klenow 

20 fragment of ONA polymerase I or T7 DNA polymerase or Taq ONA polymerase is used to synthesize a 
complementary copy of the single-stranded target sequence in the presence of the four deoxynucleoside 
triphosphates, one or more of which is radiolabelled. for example with ^9 and in four separate incubation 
mixes containing a tow concentration of one each of the four dideoxynucieostde triphosphates. A population 
of trurtcated radioactive DNA molecules, each having a common 5 end. but each varying in length to a 

25 base-specific 3 -end is thus obtained In each reaction. Foltowing appropriate irKubation the DNA in each 
mixture may be denatured, electrophoresed sid^ by side, and the radioactive bands of single*stranded ONA 
: detected by autoradiography. The sequence of the target DNA may then be read off directly from the 
autoradiograph. Furthennore E.I. Du Pont de Nemours & 0)mpany (DuPont) have recently placed on the 
market an automated Instrument for sequencing DNA strands which embodies a modification of the above- 

30 mentioned "dldeoxy* technique. This modification involves the use of the four dtdeoxynucleoside 
triphosphate terminators tagged with fluorescent dyes, each dideoxy chain terminator emitting light of a 
slightly different wavelength then excited by an argon laser [Science. 238 . 336 (1987). All four dideox- 
ynucleoside triphosphate terminators may be used In the same pot since the terminators can be 
distinguished by their emission spectra. The resulting mixture of DNA fragments may be electrophoresed in 

35 a single lane of a polyacrylamide gel. TTie sequence of the target ONA may then be read automatically 
since the identity of the nucleotide terminating each band on the gel is determined by its characteristic 
fluorescent emission. 

It has been estimated that the DuPont sequencer is capable of identifying atsout 10.000 nucleotides a 
day under optimum conditions In contrast to an average of about 50.000 nucleotides a year which is 

40 estimated for a skilled worker using the unmodified manual 'dideoxy" technique. Whilst this Is a substantial 
improvement in the speed and efficiency of determining relatively short nucleotide sequences, the rate 
determining step In the detennination of relatively long sequences, for example genomic sequences, is 
dependent on the step colloquially known as "chromosome walking*. "Chromosome walking" is a technique 
which comprises the sequential isolation of ctones carrying overlapping fragments to span a segment of for 

45 example a chromosome that Is larger than can be carried in a phage or a cosmid or a yeast artificial 
cnromosome (Y AC) vector. This technique thus allows isolation or a locus or interest tor whicn no prpoe is 
available and Is of particular use where the locus of interest is known to be linked to a locus which has been 
identified and cloned such as a gene or DNA marker. The locus which has been identified may then be 
employed as a probe to screen a genomic library and will hybridise with any fragment containing 

50 complementary nucleotide sequences and which therefore represent overlapping clones. Such overlapping 
sequences may be In either the 5 or the 3 -sense. A fragment Identified as representing an overlapping 
clone may then Itself be used as a probe to rescreen the genomic library and hybridise with any fragmems 
containing complementary nucleotide sequences which thus represent further overlapping ctones. By 
repetition of this process the nucleotide sequences of regions further and further away from the originally 
Identified locus may be determined until eventually the locus of interest is encountered. 
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The technique of "chromosome walking" involves a number of potential difficuiUes as is exemplified by 
the ume taken from discovery of a marker for a genetic disorder to discovery of the specific genetic lesion 
responsible for the disorder. Thus, for example, a linked genetic marker for Huntington's Chorea (D4Si) was 
discovered m 1983. but stili today the specific genetic lesion responsible for this disorder is not known. 
5 Similar comments apply to many other genetic disorders. 

The technique of "chromosome walking- particularly suffers from the disadvantage that clon.ng of 
genomic ONA is a prerequisite. In a number of circumstances cloning may prove impossible or at least very 
difficult and in such situations the "chromosome walk* comes to a premature end: A R Wyman and K F 
Wertman. in Methods in Enzymology. Vol 152. Guide to Molecular Cloning Techniques. S L Berger and A R 
to Kummel. editors Academic Press. San Diego, 1987 pages 173-180. Moreover the analysis of the fragments 
identified as representing overlapping clones is complex in view of inter alia the number of such fragments 
which may be k)cated in any one screening of the genomic iibrary and the fact that the overlapping 
sequences may be in either the 5 or the 3' sense. 

A further serious practical problem with "chromosome walking" is the widespread occurrence of 
15 repetitive sequences within the human and other genomes. Thus if the identified locus contains such 
repetitive elements all such elements within the human genome may be identified incorrectly as overlapping 
clones. Analysis then becomes extremely complex. 

Furthenrwre any overlapping clones obtained by "chromosonte walking" will, when sequenced as 
hereinbefore described, generate sequence information derived 'from a single ctoned allele from the 
20 identified kscus. in the context of studying me important genetic dififerences between individual members of 
a species and their relation to an individual's phenotype it would be advantageous, if more than a single 
allele couU be analysed and characterised simultaneousfy. 

The present invention is based upon tiie discovery of a method which obviates, at least in part, the 
above-mentioned difficulties by amplifying fragments obtained via cleavage of target ONA at specific sites 
25 outside known sequences, dius overcoming the need for ctoning. 

Thus according to one feature of the present invention there is provided a method for the amplification 
of a nucleic add fragment comprising unknown sequence, by primer extension which method comprises 
deaving a target nucleic acid to obtain target nucleic add fragments, one of said fragments containing an 
initiating priming region of known nucleotide sequence for hybridisation with an initiating primer, preparing 
30 target nucleic add fragmentA^ectorette units from m target nuclek: add fragments by ligation each unit 
having a vectorette priming regkan of known sequence for hybridisation with a vectorette primer, and 
treating the target nucleic add fragmentA/ectoretto units, together or sequentially, witti appropriate 
nucleoside triphosphates and an agent for polymerisation of the nucleoside triphosphates under hybridising 
conditions, such that an extension product of an initiating primer is synthesised complementary to a single 
3S stranded target nucleic acidA^ectorette unit having an initiating priming region to which is hybridised an 
Initiating primer selected so as to be substantially complementary to the initiating priming region, whereas 
no such extension product is synthesised complementary to single stranded target nucleic acid 
fragment^ectorette units having no such Initiating priming region. 

If desired the said extension product may be subjected to amplification in the presence of a vectorette 
40 primer whfch is selected so as to be substantially complementary to the vectorette priming region. 

The target nuclete add fragmentA/ectorette units are tiius treated vwth initiating prirner and. if the 
Initiating primer extension product is to be amplified" for example as described by RX. Saiki et al. Science 
m 487-491 (1987). additionally treated with vectorette primer. Where no vectorette primer is used, 
arithmetical or linear ampiifcation {hereinafter referred to as linear amplification) may be achieved by 
45 hybridisation of the Initiating primer to the initiating priming region folk}wed by primer extension In the 
presence of appropriate nucleoside triphosphates and an agent for .polymerisation of the nucleoside 
triphosphates, under hybridising conditions and denaturation. This jarocess of priming, primer extension and 
denaturation may be repeated as manv times as approoriate to achieve the desired level of amplification. 
Preferably, however, amplification is effected In the presence of both initiating and vectorette primers by the 
50 use of the PGR technique referred to atsove and as hereinafter defined. 

According to a further feature of the present Invention there is provided a kit for the amplification of a 
nucleic acid fragment of unknown sequence by primer extension, which kit comprises:- 

t) means for cleaving a target nucleic acid at a specific site to obtain a target nucleic acid fragment: 

2) a vectorette adapted for ligation to a target nucleic add fragment obtained by use of the means for 
55 cleaving a target nucleic acid defined in (1 ) whereby to fonn in use a target nucleic acid fragmentA<ectorerte 

unit said vectorette having a vectorette priming region of known sequence for hybridisation with a vectorene 
primer: 

3) each of four different nucleoside triphospfiates: and 
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4) an agent for polymerlsaiion of the nucleoside triphosphates in (3). 
A vectorette priming region nnay be present or absent from the vectorette portion of a target nucleic 
acid fragment-vectorerte unit as hereinafter defined. Thus the vectorene (2) in the kit of the present 
invention may itself contain no vectorette priming region provided that in use a target nucfeic acid 

5 fragment vectorette unit is formed in which the vectorette portion thereof contains a vectorette priming 
region. Thus such units may for example either have a vectorette priming region in the vectorette portion of 
the target nucleic acid fragment'vectorette unit as formed by ligation or have a vectorette priming region 
which only arises as a result of primer extension of an initiating primer as described hereinafter. Reference 
throughout the specification to "target nucleic acid fragmentA/ectorette unit" (aJso termed "vectorette unit") 

10 as hereinafter defined are to be so understood. 

Preferably the kit will additionally comprise a vectorette primer having a nucleotide sequence substan* 
tiaJIy complementary to the vectorette priming region of the target nucleic acid fragmen^ectorette unit. The 
presence of such a vectorette primer in the kit of the present invention will enable amplification of the target 
nucleic acid fragment/vectorette unit to be effected by PCR techniques (as hereinafter defined) if this is 

15 desired. 

The kit may more preferably also contain a series of "nested vectorette primers (as hereinafter defined) 
which may be used, for example, for secondary amplification reactions where necessary or desired and'or 
for direct sequencing of the products of amplification as has been described in Proc Natl. Acad. Sci. USA 
85. 7652-7656 (1988) by U B Gyllenstein and H Ehriich. It will be appreciated that all such vectorette 
20 primers may well be useful as direct sequencing primers for the distaJ ends of fragments obtained by linear 
amplification using an initiating primer alone. 

Since the target nucleic acid to be investigated will normally be peculiar to the use of the kit initiating 
primer(s) will not normally be present in the kit. but may be prepared by the user of the kit 

If desired however the kit of the present invention may additionally contain initiating primer(s) and also, 
is "* If desired, nested initiating primer(s). 

The kit of the present invention may also advantageously contain buffers for perfomiing the metiiod: of 
the invention, a particular feature of the kit being for example the presence of buffers for varying the 
potassium/ magnesium and nucleoside triphosphate(s) concentrations of ttie reaction. mixture. These latter 
buffers may be desirable fbr determining the optimum conditions for subsequent cycles of the method of 
30 the invention. 

Advantageously the kit will comprise more than one (a plurality oO means for cleaving a target nucleic 
acid each at specific sites. Where a plurality of such means are present the kit will normally comprise a 
different vectorette for each such means present In order to permit formation of target, nucleic acid 
fragment'vectorette units in respect of each set of target nucleic acid fragments which may be obtained. 
35 Although the invention is not limited to this application, the plurality of different vectorettes will advanta- 
geously share sequences complementary to a vectorette primer. In such cases a single vectorette primer 
will provide amplification from a plurality of target nucleic acid cleavage sites. 

Thus in a preferred embodiment the kit of the present invention additionally contains one or more 
selected from the following:- initiating primer(s), nested vectorette primer(s), nested initiating primer(s). 
40 sequencing primers, buffers for performing the method of ttie present invention, and buffers for varying the 
magnesium, potassium and nucleoside biphosphate concentrations. 

According to a further feature of the present invention there is provided a vectorette library kit for the 
amplification of a nucleic acid fragment of unkrtown sequence by primer extension, which kit comprises:- 

1) at least one vectorette library each library comprising a set of target nucleic acid 
ds fragment/vectorette units obtained from nucleotide sequences of an individual member of a species of 

animal, plant or miaoorganism: and 

2) an initiating primer or primers tor hybridisation to the initiating priming region of the target nucleic 
acid fragment/vectorette units. 

The vectorette library kit will preferably comprise a plurality of vectorette libraries. 

50 The target nucleic acid fragmentA/ectorette units may be prepared for example either directly from the 
desired species or indirectly from such a species after initial cloning in plasmid, phage, cosmid or yeast 
artificial chromosome (YAC) vectors. The species of animal, plant or microorganism employed is preferably 
human, but may be any other animal species, plant or microorganism such as bacteria, viruses, yeast or 
parasites. The nucleotide sequences are preferably from genomic DNA. but may be from sorted chro- 

55 mosomes or clones. 

The vectorette units employed may be either derived from a single cleavage metiiod and/or multiple 
vectorette units derived from multiple cleavage methods, which together or separately may constitute a 
vectorette library or plurality of vectorette libraries (as hereinafter defined). 
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The target nucleic acid fragments employed may be derived from a number of different sources Thus 
.for example the target nucleic acid fragments may be obtained from single individuals of a species of 
animal, plant or microorganism, for example from individuals typical of their species. The target nucleic acia 
fragments may also be obtained from single individuals known to be heterozygous for a given genetic locus 
for example a locus causing cystic fibrosis or other inherited disease. The target nucleic add fragments 
may also be obtained from single individuals known to be homozygous for a given genetic locus for 
example cystic fibrosis or other inherited disease. The target nucleic acid fragments may aJso be obtained 
from single individuals known to be normal homozygotes for a given genetic locus for example cystic 
fibrosis or other inherited disease. The target nucleic add fragments may also be obtained from groups of 
individuals (as opposed to single individuals) with a shared phenqtype<s). The nucleic acid or tissue from 
each member of a group which shares a phenotype may if desired by pooled. Each group of individuals will 
consist of at least 2 and advantageously less than 1000. for example 50-500. Vectorette units may be 
prepared from the target nucleic acid fragments and the vectorette units pooled or used separately to form 
vectorette libraries. The shared phenotype may if desired be a disease or disease predisposition obligate 
carnage or an inherited disease or a normal state with no evidence of the disease or disease predisposition. 

Comparison of nucelotide sequences obtained using the method of the invention will identify any 
common genetic variants |n the population which are 'associated' with for example a given disease or 
disease predisposition. It will be appreciated that this extends considerably the scope for detailed analysis 
over and above that previously attempted using RFLP technotogy. 

The vectorette library kit of the present invention preferably additionally contains vectorette primer(s) 
and advantageously one or more selected from nested intitiating primer(s). nested vectorette primer(s) and 
sequendng primers. The vectorette library kit may also conveniently contain each of four different 
nucleoside triphosphates and an agent for polymerisation of the nucleoside triphosphates. If desired the 
vectorette library kit may further contain buffers for performing the invention and/or buffers for varying the 
magnesium, potassium and nucleoside triphosphate concentrations. These latter buffers may be desirable 
for detenmlning the optimum conditions for subsequent cydes of the method of the invention. 

The present invention is of wide applicability. Thus for example the present invnetion may be employed 
to^ idwitify nucleotide sequences characteristic of a particular microorganism such as a microorganism 
responsible for disease m plants, or animals, partlculariy humans, such microorganism being for example a 
fungus, yeast bacterium or vims as well as a parasite (such as Plasmodium (malaria) o^ trypanosome 
(sleeping sickness)). The method of the present invention may also be employed to identify nucleotide 
sequences responsible for dnjg. for example antibiotic resistance in a given organism. Thus the present 
invention enables the production of probes diagnostic, of disease in animals or plants and additionally 
enables the production of probes diagnostic of drug, for example antibiotic, resistance. 

The present invention may also be employed for example 1) to detect nudeotide sequence variations 
for example point mutations, responsible for genetic disorders in plants, but especially in animals 
particularly humans: .2) to detect nucleotide sequence variations, for example deletions, responsible for 
neoplastic disease In animals, particularly humans; 3) to detect nudeotide sequence variations responsible 
fbr predispositions to disease or disorders in plants but espedally in animals, particularly humans: and 4) to 
detect nucleotide sequence variations responsible for a specific characteristic such as a desirable char- 
actwstic for example in plants such as flower colour, crop yield or heriaidde resistance. 

The present invention may also be employed in animal studies, fbr example, in the monitorino of 
transgenic animals. ^ 

Furthermore, the present invention is of .particular interest in the sequendng of an animal genome 
particularly the human genome for example to identify hitherto unknown polypeptides capable of production 
VIVO particularly in the human and which polypeptide may for example have therapeutic interest The 
present invention may also be of interest in relation to the production of pharmaceutical proteins for 
. example for sequencing regulatory regions of genes or expression systems. 

In the context of sequendng large genomes, for example the human genome, it is widely accepted that 
me major currentiy perceived limitation is the preparation of physical maps comprising overlapping 
contigs- of isolated cosmid clones. The current strategies for sudi mapping projects have been reviewed 
^ flapping our Genes" Genome Projects: How Big. How Fast?" Congress of the United Slates. Office of 
Technotogy Assessment. The Johns Hopkins University Press. Baltimore and London (1988). Further detail 
IS provided .n "Mapping the Human Genome: Experimental Approaches for Cloning and Ordering DNA 
Fragments\ R M Myer. Department of Physiology. University of California. San Francisco. USA in 
Mapping our Genes Contractor Reports. Vol 2. Order No. PB 88-162 SOa'AS available under the auspices 
vAo.®. ^'^"^ Technical Information Sen/ice (NTIS) 5285 Port Royal Road. Springfield. 

VA22161, USA. An approach to mapping the much smaller E.coli genome is taught in C L Smith et al - 
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(Science, 236. 1448-1453. t987) and Y Kohara et al (Cell. 50. 495-508. 1987). 

The present invention may also be employed for example to sequence large nucleotide fragments 
purified by cloning in cosmlds and YACs (yeast artificial chromosomes). Cosmids can be used to clone 
nucleotide fragments of for example up to about 45kb and YACs may be used to clone fragments of an 

5 average size of over 300 kb (Nucleic Acids Research 17. 3425-3433 (1989) R Anand et al). The applicability 
of the present invntion ot the sequencing of such large nucleotide fragments enables the rapid sequencing 
of very targe nucleotide sequences. 

A particularly advantageous aspect of the present invention is that sequence analysis performed using 
' the method of the Invention does not involve cloning of the target nucleic acid, in practice this means that 

to all alleles present in the target nucleic acid sample may be analysed at the same time. The advantages of 
this approach for the identification of heterozygotes as well as homozygous normals and mutants have been 
described previously Where th& direct sequendng of PGR products has been used to analyse mutations (for 
example point mutations) causing inherted diseases. (Nucleic Acids Res.. 16, 8233-8243 (1988) by C R 
Newton et al). A point mutation is easily visualised in the heterozygote because comigrating bands 

rs corresponding to the 2 different bases appear in two lanes of "dideoxy" sequencing gel. The homozygotes 
^normal or mutant) at this base pair show a single band corresponding to one base or the other at this 
position on a sequencing gel. 

Previously, a prerequisite for this type of PCR-based sequence analysts has been that the sequence of 
the region of interest has been determined in advance by methods known per se. This allows PGR primers 

20 to be designed for amplification and subsequent repetitive sequencing of the region of interest. The present 
invention allows such analysis to be undertaken with targjet nucleic acid fragments whose sequence is not 
already fully establishekJ. Sequencing as performed using the method of the invention on for example 
human genomic DNA from an Individual reveals the full extent of polymorphic differences between the 
diploid chromosomes. If the individual is known to be an obligate heterozygote for an unknown mutation at 

25 a given genetic locus then sequencirtg at both alleles (preferably simultaneous sequencing) by the method 
of the invention will inevitably reveal ail the polymorphic bases present which could therefore be causes of 
an observed phenotype. The importance of Individual polymorphic base changes may be confirmed -or 
disproved by subsequent study of the same target nucleic acid sequences in known homozygous normals 
and mutants (as well as In other obligate heterozygotes). It will furthermore be appreciated that the present 

30 Invention- represents a considerable advance over current RFLP (restriction fragment length polymorphism) 
based approaches to the study of vanattons between chiromosomes. In that case, only those base changes 
which Infrequently create or destroy a restriction endohudeasd recognition site are detected. Techniques 
are available for the facilia analysis of point mutations which do not result in RFLP (Nucleic Acids Res., 17 

- No 7 pages 2503-2516 (1989) by C R Newton et al). 

35 A further feature of the present invention Is its utility in the analysis of polygenetic or multifactoral 
diseases. Thus, for example, it would be advantageous to establish whether particular polymorphic variants 
at certain genetic loci are indicative of a predisposition to develop such multifactoral polygenetic diseases 
as atherosclerosis, hypertension, diabetes, schizophrenia and the like. Attempts to approach this problem 
previously using RFLP techniques have, proved generally unsuccessful. This is probably not surprising as 

40 such studies have relied on the examination of particular polymorphis«Tis displayed on large numbers of 
' labour-intensive Southern blots by individual candidate gene probes. Thus the range of the total polymor- 
phism in the human genome actually analysed for linkage to disease predisposition has been in fact 
vanishingly small, (see "Molecular Approaches to Human Polygenic Disease". Ciba Foundation Symposium 
130, John Wiley. Chichester 1987). Using the present invention it becomes possible to mix target nucleic 

45 acids from a numt)er of individuals with a given disease, disease predisposition or other phenotype. 
Vectorette libraries as hereinafter defined may be constructed from such pooled target nucleic acids, for 
example pooled human genomic ONA. The present invention allows this pooled genomic DNA to be 
sequenced in a processive manner. Common polymorphisms will become apparent in a similar fashion to 
the observations with a single heterozygote. Again the lack of any need for recombinant DNA cloning 

50 means that all alleles present in the pooled genomic material may be analysed simultaneously. The number 
of individual samples present in the pool is not limited. Use of the method of the present invention simply 
reveals a concensus genotype plus polymorphism. A comparison of results thus obtained with similar 
results obtained by analysis of pooled genomic ONA from individuals who do not manifest the given 
disease, disease predisposition or other phenotype will allow polymorphisms associated with the disease or 

55 phenotype (or conversely with a lack of the disease or phenotype) to be identified. The analysis is not 
-limited to the small area of the human genome close to candidate genes. This type of analysis using the 
method of the invention may be further facilitated by combining the amplified products obtained with a 
given initiating primer and a particular vectorette library made with target nucleic acid from either a 
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-nornnar or an "affected" pooled population of individuals. H there are indeed gross polymorphic 
differences between the populations then mixing the amplified products, denaturing and re-annealing would 
generate double-stranded amplification products with mismatched base pairs. Such mismatches may i:e 
revealed using reagents such as Os04 or hydroxylamine as taught by (R G H Cotton. N fl Rodrigues and R 
5 0 Campbell PNAS. 4397-4401 1988) 

A glossary of certain terms used herein is set out below in order to assist the reader of the present 
spedficatton. 

The term "target nucleic acid" refers to a nucleotide sequence, generally genomic ONA. for example 
plant or animal genomic DNA such as human OtiA or bacterial ONA. Such a "target nucleic acid" for use in 
to the method ol the present invention will normally comprise a portion (generally a smalt portion) of known 
sequence and generally a much larger portion of unknown sequence. 

The term "target nucleic ackJ fragment" is used herein to mean a fragment of a target nucelic acid 
obtained by cleaving (as hereinafter defined) target nuclefe acid. The term "target niiclek: acid fragment" 
therefore is not limited to such a fragment obtained by the use of a restriction endonuclease. Furthermore 

ts such a fragment rnay for example comprise a portion of known sequence and generally a larger portion of 
unknown sequence, or the fragment may be of unknown sequence. Where the fragment is of unknown 
sequence, cleavage and generation of target nucelic add fragmenVvectorette units may be performed as 
described hereinaftiBr. In such drcumstances random DNA probes may be generated from the much larger 
genomic fragment by linear amplification with a "random" initialing primer or amplification with a random 

20 initiating primer plus vectorette primer. Specific amplification will generate fragments at random which can 
be purified and used in genetic mapping as in CeU 5t^. 319-337 (1987) H Oonis-Ketler et al. 

The term "ampliftcation" is used herein to refer to the replication of a nudeotidTSquence and'or its 
complementary sequence 'by non-biok)gical means and thus includes amplification by the use of an 
initiating primer atone such- primer being 'hybridised to the initiating priming region of a target nucleic add 

25 f^agment^ectorette unit primer extension being effected in the presence of appropriate nucleoside 
triphosphates and an agisnt for the polymierisation of the nucleoside triphosphates under hybridising 
conditions, such primer extenskxi being fbltowed by denaturatksn. this process of prirting. primer extension 
and denaturatton being repeated as many times as appropriate to achieve a desired level of amplification. 
The tenn ^amplification" also includes replicatksn by polymerase chain reaction (PCR) techniques as for 

30 example described by R.K. Saiki et al. in Science m 487-491 (1987) and in U.S. Patents Nos 4.883.195 
and 4.683.202 using an initiating primer and a vectorette primer as hereuiafter defined and the expressk)n 
polymerase chain reaction technique or PCR technique as used herein to refer to such techniques as 
described in these references. 

The term "non-biologicai" is used herein only to exclude amplificatibn by direct cloning and propaga- 

05 tion of bacterial-cok)nieS;-lt will therefore be appredated that the term amplification is used herein to Indude 
amplification processes such as those described in PCT Patent Publication WO 87.X)6270 (or Biotechnology 
Vol 8, October 1988). PCT Patent Publication WOSS^I 031 5 or PCT Patent Publfcation WO SaDlOSO. 

The term "cleaving " is used herein to refer to cleavage of a nudeic add at a specific site. Such 
cleavage is conveniently effected using a restriction endonuclease. preferably a 6bp culler, having a known 

40 recognition sequence and cleavage pattern and thus the characteristk: of cleaving ONA at specific sites. 

In this regard the position of these "specific sites" in a given targist nucleic acid' will not normally be 
known relative to the position of another known element in the target nucleic add. but the sequence of the 
spedfic sites will be known as will their cleavage patterns. Thus the terminal sequences of the restriction 
fragments obtained will be known. Thus for example the restriction endonuclease EcoRi recognises the 

45 sequence:- 

h; } AATTC " . -O + AATTC- 

-CTTAaI G- -CTTAA C- 

50 

and. cleaves such a sequence as indicated to yield restriction fragments having me cohesive ends indicated. 
Since the sequence of the cohesive end is known it is possible to produce target nucleic acid 
fragment^/ectorette units on the basis of this knowledge as. described hereinafter in relation to the 
.J expresston "target nucleic add fragment^rectorette unit". Means for cleaving a target nucleic acid at a 
specific site other than restriction endonucleases may be employed, but the use of a restriction en- 
donuclease is preferred. Any convenient . restriction endonuclease may be used. Examples of individual 
restriction endonuclease include those detailed in Nudeic Adds Research. Sequences Supplement. Volume 
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t6. 1988 page r271-r3i3 and in "Current Protocols in Molecular Biology 1987-1988. Edited by Ausubel 
P.M.. Brent R.. Kingston R.E.. Moor D.D. Smith JA. Seidnr^an J.G. and Slruhl K.. Wiley Interscience. 
Section 3. Table 3.1-1. Restriction endonucleases capable of producing fragments with cohesive ends such 
as EcoRI. Hind ill and Xbal are convenient regardless of whether the cohesive ends have 5 or 3 

5 overhangs. It will also be appreciated that as well as by using restriction endonucleases capable of 
producing fragments with either a 3 or a 5 overhanging cohesive end the production of target nucleic acid 
fragment'vectorette units is also possible using restriction endonucleases producing blunt-ended fragments 
in conjunction with appropriate blunt-ended vectorettes as hereinafter defined. Means for cleaving a target 
nucleic acid at (a) specific site(s) other than by standard restriction endonuclease. digestion are known in the 

w art and include, for example, the use of adapters-primers and Class^HS restriction enzymes (S C Kim et al. 
Science . 240 . 504-506 (1988); W Szybalsld. Gene. 40. 169 <1985): A J Podhajska and W Szybalski. Gene . 
40, 175 (1985)) as weJ! as various chemical approaches (B L fverson and P B Dervan. J. Amer. Chem."SocT. 
109 . 1241-1243 (1987): G B Oreyer and P 8 Dervan. Proc. Natt. Acad. Sci USA. 82~968 (1985): V V 
Vlassov et al. Nucleic Acids Res .. 14. 4065 (1986); H 6 Moser and P B Dervan. Science . 238. 645 (1987); 0 

;5 R Corey and P G Schultz. Science . 238 . 1401 (1987); J P Sluka et aJ. Science . 236. 1 129TT987)). 

Furthermore, it is possible to render target nucleic acid fragments blunt-ended, either by DNA • 
polymerase mediated filNn /repair of any 5 overhanging cohesive ends or by removaJ of 3* and'or 5 
overhangs using 31 nuclease mediated single strand digestion, irrespective of. whether they are generated 
by restriction enzyme digestion or other (eg chemical )means. Ail such blunt-ended fragments may again be 

20 converted into target nucleic acid fragment/vectorette units by attachmerit of the appropriate blunt-ended 
vectorette. 

, The expression "initiating priming region" as used herein means that. portion of a cleaved, for example 
restriction enzyme digested, target nucleic add fragment which Is of- known nucleotide sequence and to 
which in use an initiating primer and if desired nested initiating - primer(s) (as hereinafter defined) for 
25 example pveriapping nested initiating primer(s) may hybridise. In general the method of the present 
invention will be effected such that only one of the target nucleic add fragments present will have: an 
initiating priming region. An exceptkvi woukl be the use of a mixture, of a plurality of a different vectorelte 
. libraries as hereinafter defined. 

The expression "vectorette priming region* as used herein means that portion of the target nucleic acid 
30 fragment/vectorette unit which is of known nucleotide sequence as defined by the vectorette itself and to 
whictn in use the vectorette primer and. if desired nested vectorette primer(s) (as hereinafter defined) for 
example overlapping nested vectorette primer(s). may hybridise. The vectorette priming region is present in 
the. strand complementary to the strand containing the initiating priming region. In this regard the 
"vectorette priming region" to which in use the vectorette primer and if desired nested vectorette primer(s) 
35 will hybridise may be present either in the target nucleic acid fragment/vectorette unit prepared by ligation 
or in a target nucleic acid fragment/vectorette unit prepared by primer extension of an initiating primer or in 
both. Thus it will be appreciated that the vectorette primer and if desired nested vectorette primer(s) to be 
used in the method of the present invention may be ] selected for hybridisation with a vectorette priming 
region which may. for example, not be generated until primer extension of an initiating primer and if desired 
. 40 nested initiadng primer(s). A consequence of this is that, for example the vectorette itself need not be a 
' totally self-complementary double-stranded ONA fragment , 

The term "primer" as used herein refers to an oligonucleotide, whether, occurring naturally as in a 
purified restriction digest or produced synthetically, which is capable of acting as a point of initiation of 
synthesis when placed under conditions in which synthesis - of a primer extension product which is 
45 complementary to a nucleic acid strand is induced, i.e.. in the presence of appropriate nucleoside 
triphosphates and an agent for polymerisation such as DNA polymerase in an appropriate buffer ("buffer" 
includes pH. ionic stt-ength. cofactors. etc.) and at a suitable temperature. 

The primer is preferably single stranded for maximum efficiency in extension, but may alternatively be 
double stranded. If double stranded, the primer is first treated to separate its strands before being used to 
so prepare extension products. Preferably, the primer is an oligodeoxy ribonucleotide. The primer must be 
sufficiently long to prime the synthesis of extension products in the presence of the agent for polymeriza- 
tion. The exact lengths of the primers will depend on many factors. Including temperature "and source of 
primer and use of the method. For example, depending on the complexity of the target sequence, the 
initiating and vectorette primers will typically contain 1 5-35 nucleotides, although they may contain more or 
55 fewer nucleotides. Short primer molecules generally require lower temperatures to form sufficiently stable 
hybrid complexes with the template. 

The term "Initiating primer" is used herein to refer to a primer capable of hybridisation with the initiating 
priming region as hereinbefore defined. In use with, for example, vectorette units prepared from total human 
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genomic DNA. it may be preferable for the "initiating primef to be longer than 15-1 7 nucleotides so as to. 
avoid hybridisation and priming, at random, to sequences present-in the human genome which happen by 
chance to match that of the initiating priming region. 

The term "vectorette primer" is used herein to refer to a primer capable of hybridisation to the 
vectorette priming region of the target nucleic acid fragment^vectorette unit The -vectorette primer" will 
have a nucleotide sequence such that .t is capable of hybridising to an initiating primer extension product 
after separation from its complement, whereby the initiating primer extension product serves as a template 
for synthesis of an extension product of the vectoristte primer; thereby fadlitating amplification. Since, in 
general, the method of the present invention will be effected such that only one of the target nucleic acid 
fragmentvectorotte units has an initiating priming region, only tfiat unit will be subjected to amplification. 
Those target nucleic acid fragmenlA^ectorette units which do not have an initiating priming region will be 
incapable of PGR amplification because whilst the formation of a vectorette primer extension product may 
be possible, no initiating primer will be able to hybridise to the vectorette primer extension product since no 
initiating priming region will be present and thus no PGR amplification will be possible. 

In use it is preferred that the synttiesis of vectorette primer amplification products is dependent upon 
the inital synthesis of an extension product of the initiating primer. This avoids formation of large numbers 
of unamplifiable vectorette primer extension products which might be expected id deplete the available 
nucleoside triphosphates or other co-factors in the reaction mixture in a detrirhenta* fashion. 

The term "nested primer" as used herein means a primer displaced by one or more base pairs in the 
3 direction away from the 5 terminus of the initiating primer or in the 3 direction away from the s' 
tenninus of the vectorette primer or ia the 3' direction away from the 5 temiinus of both such initiating 
primer and vectorette primer. It wiU be appreciated that the sequence of ttw nested primer or primer(8) will 
necessarily .be selected from sequence complementary to the known initiating or vectorette priming regions 
or from both such regions; 

The term "target nucleic add fragment/vectorette unit* (also referred to herein as a "vectorette unit*) is 
used herein to refer to a nucleotide sequence, for example a ONA sequence, comprising a target rrucleic 
acid fragment and a portion of known nucleotide sequence, for example a ONA sequence, which portion in 
single stranded fomi is capable of hybridisation to a vectorette primer as hereinbefore defined. In this 
regard it will be appreciated that the "target nucleic acid fragmeriwectorette unit" m-ay be capable of 
hybridisation to a vectorette primer either by virtue of the presence in the aforesaid unit of a portion of 
known nucleotide sequence which sequencers substantially complementary to the sequence of the 
vectorette primer or by virtue of the ability of an initiating primer extension product based on one strand of 
the aforesaid unit as template, to comprise a nucleotide sequence substantially complementary to the 
sequence of the vectorette primer. It will be appreciated In this regard that the "target nucleic acid 
fragment^/ectorette unit* is such that in one strand It will have substantially the same sequence at least in 
part as the vectorette primer. This strand will also contain the initiating priming region. 

The portion of known nucleotide sequence, for example. ONA sequence,, may be derived from any 
convenient source, provided that it fulfils the above-stated requirement that in single stranded form it is 
capable of hybridisation to the vectorette primer. Thus for example the vectorette may be prepared 
separately using a ONA synttiesiser and the vectorette obtained llgated to the target nucleic acid fragment- 
(s) to obtain the target nucleic acid fragment^ectorette unit(s). In this regard the vectorette will conveniently 
be. adapted for ligation to the nucleic acid fragment(s). for example a cohesive end on the target nucleic 
acid fragment(s) hybridising with a cohesive end on the vectorette to form a vectorette unit as hereinbefore 
defined or a blunt ended target nucleic acid fragment may be llgated to a blunt ended vectorette to form a 
vectorette unit as hereinbefore defined. It is not necessary however that the vectorette be pre-formed prior 
to ligation with the target nucleic add fragment although this may be preferable. Thus for example the 
aforementioned unit may be prepared, in an appropriate case, by ligation of a single stranded ONA to the 
target nucleic add fragment, for example by utilising the overhang of the cohesive end to secure a first 
single stranded ONA thereto and then to permit a second single stranded ONA to be ligated so as to form 
the desired unit. 

In one embodiment of the present Invention the target nucleic add fragment/vectorette unit contains a 
blocking vectorette portion. The term "blocking vectorette" as used herein refers to a vectorette or to the 
vectorette portion of a target nucleic acid fragmentn/ectorette unit In which one or both free terminal bases 
are in modified form to prevent ligation of nucleotides thereto or to prevent primer extension for example in 
the presence of appropriate nucleoside triphosphates and an agent for the polymerisation of the nucleoside 
tnphosphates under hybridising conditions. Such modifications are known per se and may for example 
consist of tile presence of a dideoxynucleoside. Thus for example the doubleltranded blocking vectorette 
may have a 3 terminal dideoxynucleoside such as dideoxyadenosine (ddA). Such modification may also 



Include ribonucleosides in which the diol of (he ribose is cleaved, for example with periodate. Alternatively a 
3 -deoxynucleoside for example a J -deoxy adenosine residue may be added at the 3 -terminus using for 
example cordycepin triphosphate and terminal transferase. As a further alternative a 3 -amino or 3 -thio 
functionality may be introduced chemically at the 3' end by methods known per se. 

5 In a further embodiment of the present invention a vectorette or vectorette portion comprises two 
partially hybridised single stranded sequences which possess a degree of non<omplementarity such that 
vectorette primer extension cannot be effected using such a vectorette. 

The term "nucleoside, triphosphate" is used herein to refer to the triphosphates of nucleosides present 
in either DNA or RNA and thus includes nucleosides which incorporate adenine, cytosine. guanine, thymine 

to . and uracil as base, the sugar moiety being deoxyrilx)se or ribose. In general deoxyribonucieosides wtii be 
employed in combination witti a ONA polymerase. It will be appreciated however that other modified bases 
capable of base pairing with one of the conventional bases adenine, cytosine. guanine, thymine and uracil 
may be employed. Such modifted bases include for example Z-deazaguanine and hypoxanthine. 

The term "nucleotide* as used herein can refer to nucleotides present in either ONA or RNA and thus 

;5 includes nucleotides which incorporates adenine, cytosine. guanine, thymine and uracil as base, the sugar 
moiety being deoxyribose or ribose. it will be appreciated however that other modified bases capable of 
base pairing with one of the conventional bases, adenine, cytosine. guanine, thymine and uracil, may be 
used in the initiating primer and vectorette primer employed in the present invention. Such modified bases 
include for example 7-dea2aguanine and hypoxanthine. 

20 The agent for polymerization of the nucleoside triphosphates may be any compound or system which 
will function to accomplish the synthesis of primer extension products. Including enzymes. Suitable 
enzymes for this purpose include, for example. E.coli DNA Polymerase 1 < Richardson C.C.et al. J 8iol. 
Chem. 229222 (1964)). Klenow fragment of E.coli DNA polymerase 1 (Jacobsen H. et al. Eur. J. Biochem. 
45. 623-627 (1974)). T4 DNA polymerase (Panet A. et al. Biochemistry \Z 5045-5050 (1973)). T7 ONA 

25 polymerase (Tabor S. and Richardson C.C.. Proc. NatiT Acad. Sd. USA 84. 4787-4771 (1987)) other 
. available DNA polymerases, reverse transcriptase, and other enzymes, including thermostable enzymes. 
The term "thermostable enzyme' as used herein refers to an enzyme which is relatively stable to heat and 
is heat resistant and catalyzes (facilitates) combination of the nucleotides in the proper manner to form the 
primer extension products which are complementary to each nucleic add strand. Generally, the synthesis 

3a will be initated at the 3' end of each primer and will proceed In the 5 direction along the template strand, 
until synthesis terminates, generally producing molecules of different lengths. In the context of the current 
invention synthesis will generally terminate at a position determined by the target nucleic acid cleavage site 
resulting In molecules of the same length. There may be enzymes, including thermostable enzymes, 
however, which initiate synthesis at the 5* end and proceed in the other direction, using a similar process to 

33 that described above. A preferred thermostable enzyme which may be employed in the process of the 
present invention Is extracted and purified from Thermus aquaticus and has a molecular weight of about 
86.000 - 90.000 dattons as described in European Patent Publication No. 237.362 (see also European 
Patent Publication No. . 258.01 7). Thermus aquaticus strain YTI is available without restriction from the 
American.Type Culture Collection. 12301 Pariclawn Drive, Rockvllle. Maryland. USA as ATCC 25.104. 

40 It will be obvious that ONA polymerases having superior or more advantageous properties may be 
obtained by random mutation of clones expressing these proteins. By way of example, it would obviously 
be preferable to obtain a mutated version of double stranded DNA encoding Taq DNA polymerase which 
. resulted in the expression of a protein with superior properties such as lack of a s'-exonuclease activity. 
Techniques for obtaining such desired mutated versions are well known and are well within the skill of the 
. 45 average molecular biologist 

The term 'complementary to" is used herein in relation to nucleotides to mean a nucleotide which will 
base pair with another specific nucleotide when incorporated into ONA or RNA. Thus deoxyadenosine 
triphosphate is complementary to thymidine triphosphate and deoxyguanosine triphosphate is complemen- 
tary to deoxycytidlne triphosphate, whilst deoxyguanosine triphosphate Is not complementary to thymidine 

50 triphosphate. It is appreciated in this regard that whilst thymidine triphosphate and deoxyguanosine 
triphosphate may base pair under certain circumstances they are not regarded as complementary for the 
- purposes of this specification. 

The primers herein are selected to be "substantially" complementary to the different strands of each 
. specific sequence to be extended or amplified. This means that the primers must be sufficiently com- 

. 55 plementary to hybridise with their respective strands. Therefore, the primer sequences need not reflect the 
exact sequence of their templates although this would normally be preferable. 

The term "vectorette library" is used herein to refer to the plurality of target nucleic acid 
fragment/vectorette units which are obtained after cleaving a target nucleic acid at all the potential cleavage 
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Sites in respect of a given restriction endonuclease which the target nucleic acid contains and preparing 
. target nucleic acid fragmentvectorette units from the totaJ mixture of target nucleic acid fragments by 
cycles 9f ligation to a suitably adapted vectorette portion (and repeat cleavage if required). In general only a 
single vectorette unit m a given, vectorette library will contain the initiating priming region of interest. In the 

s case of human genomic ONA cleaved with a restriction endonuclease recognising a specific 6 bp sequence 
(a 6 bp cutter) the average size of the resulting target nucleic acid fragments will be 4096 bp and the target 
nucleic acid generates approximately lO* such fragments. Hence a vectorette library containing approxi- 
mately 10« target nucleic acid fragmentvectorette units is obtained from human genomic ONA cleaved with 
. a 6 bp cutter restriction endonuclease and only one such vectorette unit in that total human vectorette 

10 library will contain a given initiating priming region capable of initiating amplification in the presence of an 
initiating primer, a vectorette primer if desired, appropriate nucleoside triphosphates and an agent for 
polymerisation of the nucleoside triphosphates.under hybridising conditions. 

Different vectorette libraries may be prepared from the same target nucleic acid by cleavage with 
different restriction endonucleases and ligation of suitably adapted vectorette portions to generate target 

JS nucleic acid fragment/vectorette units. All available restriction endonucleases can be used in this process if 
desired and in the limit a vectorette portion can be ligated to target nucleic acid fragment at every 
, restriction enzyme recognition site in the target nucleic acid. This feature is not always desirable as ideally 
the initiating priming region of interest in any given vectorette library will be separated by 100 bp or more 
from the attachment point of the vectorette portion. This is because initiating primer extension products or 

20 initiating primer/vectorette primer amplification products smaller than this generate so little sequence 
information in the practice of the invention as to be of little value for the efficient sequencing of long 
nucleotide sequences; Furthermore the nucleotide sequence of such small products will be contained within 
the products obtained using a vectorette library in which the initiating primer is further from the vectorette 
portion attachment site. The use of a plurality of different vectorette libraries with a particular initiating 

29 primer r allows identification of those libraries wherein the extension or amplification products are of a 
convenient size for sequencing. For example it may be particularly convenient to select initiating primer 
extension or amplification products of approximately 200 bp. 400 bp, 600 bp, 800 bp. tOOO bp and so on 
obtained from, particular- vectorette libraries with a given initiating primer. Sequerwang of such products, from 
the vectorette libraries in which they happen to occur, for a given initiating primer, using a vectorette or 

30 nested vectorette sequencing primer and methods known per se is likely to generate overiapping sequence 
data fG|r a large region to the 3 -side of the initiating primer. The amount of sequence data generated in one 
round of analysts of a plurality of vectorette libraries with a given initiating primer is only limited by the size 
of initiating primer'extension or amplification products which can be obtained in practice and/br by the 
distance (from the inltating primer region) to the most remote restriction endonuclease site represented in 

js . the plurality of vectorette libraries. 

In an advantageous embodiment of the method of the present invention, the target nucleic acid 
fragmentvectorette units are prepared from target nucleic acid fragments by ligation such that the 
vectorette cannot be cleaved from . the. target nucleic acid fragment/vectorette unit formed by the same 
agent used to cleave the target nucleic acid to yield target nucleic add fragments, it is especially preferred 

40 that the target nucleic add is c'eaved with a restriction endonuclease to yield a target nucleic acid fragment 
for ligation to a vectorette. the sequence of. the vectorette being selected such that the restriction 
endoniflease recognition sequence of the said restriction endonuclease is absent in the target nucleic acid 
fragment/vectorette unit, for example in the case of EcoRI described 

-GAATTC . -G + AATTX- 

-CTTAAG -CTTAA Y- Vectorette 

50 . Target nucleic acid 

fragoent 

above where X is any nucleoside other than C and Y as Its complementary nucleoside. 

The present invention may be effected such that the synthesis of primer extension products or 
preferably vectorette primer amplification products is dependent upon the initial synthesis of an extension 
product of the' initiating primer. 

This may. for example, be achieved by linear amplification or preferably by the use of a vectorette 

n 



primer which Is only capable of hybridising lo the extension product of the initiating primer. Thus 
advantageously iho vectorette portion of the target nucleic acid fragmentvectorette unit comprises a double 
stranded portion having first and second strands, the first strand having a terminal polymerisatfon blocking 
moiety and the second strand, which is ligated to that strand of the target nucleic acid fragment containing 

s the initiating priming region, carrying a single stranded .portion, the terminal polymerisation blocking moiety 
being effective to prevent extension of the first strand to form a complement lo the said single stranded 
portion of the second strand in the presence of appropriate nucleoside triphosphates and an agent for the 
polymerisation of the nucleoside triphosphates under hybridising conditions. The vectorette primer is thus 
incapable of hybridising to the target nucleic acid fragment^/ectorette unit, but is capable of hybridising to 

to the extension product of the initiating primer. Thus no vectorette primer extension products are obtained 
until generation of an extension product of the initiating primer, following which the extension products 
obtained normaJly include a portion complementary to the known nucleotide sequence(s) of the single 
stranded portion of the second strand. This advantegous embodiment is illustrated hereinafter In Figure 6. 
The polymerisation blocking moiety is effective to prevent polymerisation of a nucleotide sequence 

/5, from a primer to form the complement of a template nucleotide sequence' in the presence of an agent for 
polymerisation of nucleoside triphosphates such as a DNA polymerase, for example Klenow fragment of E. 
coli ONA polymerase I, T7 DNA polymerase or Taq DNA polymerase. The potyriierisation blocking moiety 
may be any convenient group knowri for this purpose such as ah appropriate modified nucleoside for 
example a dideoxynucieoside or a 3 -deoxynucleostde such as cordycejpin or a 3' amino or a 3'-thio 

20 functionality. 

In a further especially preferred embodiment of the present invention the vectorette portion of the target 
nucleic acid fragmenlA/ectorette unit comprises a double stranded portion having first and second strands, 
the second strand of the vectorette portion being ligated to that strand' of the target nucleic acid fragment 
which contains the initiating priming region, and the nucleotide sequence of the first strand, second strand 

25 and vectorette primer being selected sui:h that the vectorette primer is capable of hybridising to the 
complement of the second strand but not to the first strand under the same hybridisation conditions. It will 
be appreciated that the presence of a polymerisation blocking moiety in this embodiment is not necessary. 
It will further be appreciated, in this case, that the sequence of the viectorette primer will be substantially the 
same as the. sequence of at least a portion of the second strand of the vectorette. This especially preferred 

30 embodiment of the invention is discussed further hereinafter in relation to Figure 6. 

A further prefierrad embodiment of the present invention comprises the preparation of a plurality of 
diff^ent vectorette libraries for use with the same single initiating primer, each vectorette library being 
prepared by cleaving target nucleic acidat different cleavage sites and preparing target nucleic acid 
fragment/vectorette units from the target nucleic acid fragments by ligation whereby to form the said 

35 .vectorette library: treating each vectorette library either separately or together witti appropriate nucleoside 
triphosphates and an agent for polymerisation of the nucleoside triphosphates under hybridising conditions 
whereby to obtain a plurality of initiating^ primer extension products based on use of the same single 
initiating primer. The size of such extension products will be determined by the distance from the initiating 
primer to die closest s' site for the particular cleavage means, for example restriction enzyme used to 

40 construct that particular library. < . 

If desired one or more of said initiating primer extenskwi products may be isolated and/or sequenced or 
at least a portion of the extension product may be' sequenced. Thus for example this embodiment may be 
convenientiy used to identify a desired, normally the longest target nucleic acid fragment containing an 
initiating priming region, so tiiat the 3* terminal end may be sequenced conveniently with a nested 

4$ vectorette primer as hereinbefore described in order to provide a new start point for further use of the 
method of the present invention such as tiiis preferred embodiment. The sequence of the 3 terminal end of 
the aforementioned longest target nucleic acid fragment may thus become the initiating priming region of a 
new target nucleic acid fragment for a further round of vectorette library multiple initiating primer extension 
product fonnation. identification of the longest target nucleic acid fragment and sequencing. 

50 In, selecting a new initiating priming region on the basis of novel sequence data generated using the 
method of the invention at the 3' terminal end of a target nucleic acid fragment such sequence data may 
routinely be compared with the publicly available database compilations of known nucleic acid sequence 
(for example Genbank. EM8L) so as to ensure tiiat a proposed new. initiating priming region does not by 
chance closely match a known nucleic acid sequence elsewhere in for example the genomic DNA of 

55 interest. This is obviously most likely to occur in those cases where the 3 -terminal end of a particular target 
nucleic acid fragment happens to comprise repetitive elements such as for example Alu sequences. In such 
cases it is advantageous to perform the method of the Invention of a plurality of vectorette libraries with a 
given initiating primer so as to guarantee that at least one of the resulting extension products has a non- 
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repeiltiva-unique 3 -lerminal end for the selection ot a further initiating priming region. 

Stepwise progression from one previously unknown initiating priming region to another along a target 
nucleic acid, for example human genomic ONA. may conveniently be monitored using samples of the said 
target nucleic acid separately cleaved to completion with the same restriction endonucleases as used in the 
5 preparation of target nucleic acid fragment vectorette units f vectorette libraries" as hereinbefore defined) 
and subjected to agarose gel electrophoresis and Southern Blotting. Probing of the filters so obtained with a 
first initiating primer will reveal a pattern of bands consistent with the various restriction enzyme recognition 
sites surrounding this first initiating priming region in the target nucleic acid. Use of the method of the 
present invention with a plurality of vectorette libraries and this first initiating primer will generate a series of 
JO extension products each of whose 3'terminal ends are defined by the position relative to the initiating 
priming region of the closest recognition site for the restriction enzyme used to generate the vectorette 
library In question. Thus a map of the restriction sites to the 3 side of a first initiating primer is effectively 
obtained. Having subsequentfy selected a second novel initiating priming region of previously unknown 
sequence, linkage to the first initiating priming region Is established by reprobing the above Southern Blot 
rs filter with the second novel initiating primer. The pattern of bands obtained will be identical to that obtained 
with the first initiating primer in those cases where no recognition site for the restriction enzyme in question 
lies between the first and second initiating priming regions. In those cases where a recognition site for the 
restriction enzyme' in question does occur between the initiating priming regions as judged by the 
appearance of smaller extension products in the con-espondlng vectorette library.then a fragment of 
20 different size will nom>ally be obsen/ed on reprobing the Southern Blot filter with the second initiating 
primer. By repetition of this method consistency, accuracy and reliability of stepwise progression from one 
initiating priming region to another along a target nucleic acid is maintained and assured. 

It will be appreciated that the seqeunce of the 3'-termlnal ends of all the plurality of initiating primer 
extension products may be easily obtained using the same vectorette primer or nested vectorette primers 
25 for sequencing by methods known pw se- this way the entire sequence of an unknown segment of target 
DNA nucleic add may be determined in a facile and systematic manner and with much greater conve- 
nience than for example using M13 ^Shotgun' cloning. This is because the initiating primer extension 
products can be ordered by size and therefore the order of their sequences in the original target nucleic 
add becomes apparent.^ Each initiating primer extension product shares a s'-extremity' determined by the 
. 30, Initiating primer and a 3 -extremity determined by the closest 3'-s|te for the particular cleavage means, for 
example restriction enzyme, used in the synthesis of that paiticular vectorette library. 

In a preferred embodiment according to. fte present invention any or all of the initiating primer 
extension products obtained is (are) sequenced (as hereinafter defined) at least at the end(s) distal to a 
given initiating primer so as to detennine the sequence of a further initiating primer whereby to obtain 
35 further initiating primer extension products based on primer extension of the further initiating primer. 

In a further pref^ed embodiment according to the present invention an initiating primer extension 
product or portion thereof is sequenced (as hereinafter defined) whereby to characterise the said extension 
product or portion thereof. 

As described above one potentially Important application of the present invention is the identification of 
40 a previously unidentified genotype, for example a genetic defect(s) responsible for a phenotype. for 
example a genetic disease or disorder or the identification of a previously unidentified genotype, for 
example a genetic defect(s) whkrfi is (are) responsible for or a contributory factor in predisposition to a 
phenotype. for example a disease 

thus for example In relation to a genotype such as a genetic disease or disorder the method of the 
45 present invention may be applied to nucleic acid which does not contain the genotype (e.g. genetic defect- 
(s)) and to nudeic add which does contain the genotype e.g. genetic defect(s) to be investigated, 
identification of the genotype e.g. genetic defect(s) being effected by comparison of the information 
generated by sequencing of the two nucleic add samples. Such comparison might simply be effected, for 
example, by comparison of the sequendng gels conveniently by automatic scanning. In this regard it will .be 
50 appreciated that the specific sequences need not be determined per se provided that suffident data is 
generated to enable a difference or differences between the target nucleic acid samples to be detected and 
identified, and the temns "sequencing* and "sequenced* are accordingly used herein to include not only 
specific nucleotide sequence detem>ination. but also the detection and identification of sequence differ- 
ences without spedfic nucleotide sequence determination. It is convenient to apply the method of the 
55 invention to the target nucleic acid of an obligate heterozygote for example for the genetic disease or 
disorder to be investigated. Of necessity both a normal and a mutant allele for the locus in question will be 
present in such an individual and those sites identified using the method of the invention where more than a 
single nucleotide is present on sequendng are candidates to be the phenotype. e.g. disease or disorder 
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causing mutation. 

In addition to the above it Is suspected that certain genotypes e.g. genetic defects may predispose 
individuals to phenotypes for example diseases such as premature atherosclerosis, hypertension, diabetes 
and cancer. For example, if such genetic defects could be identified then such "high risk"* patients could be 

5 monitored and any onset of the disease treated at an early stage. The method of the present invention may 
be applied to the identification of such predisposing genotypes. Thus for example the method of the 
present invention may be applied to the nucleic acid of a plurality of individuals affected by a phenotype to 
be investigated on the one hand and to the nucleic acid of a plurality of individuals presenting no evidence 
of the said phenotype on the other hand, identification of a genotype being effected by comparison of the 

10 sequences of the nucleic acid samples. Conveniently nucleic acid from the plurality of individuals affected 
by the phenotype to be investigated will be pooled and subjected to the method of the present invention 
and similarty nucleic acid from the individuals presenting no such evidence of the said phenotype will be 
pooled and subjected to the method of the present invention. Comparision of the sequence differences 
between the two pools will identify the presence of any predisposing genotype if any is(are) present. The 

IS advantage of this technique Is that it enables individual predisposing genotype to be Identified irrespective' 
of their frequency of occurence and irrespective of the overall complexity and number of different 
contributory genetic factors to the overall phenotype. Thus if the presence of a combination of apparently 
unrelated genetic defects are responsible for or represent a contributory factor in the predisposition to a 
disease to be Investigated, the method of the present invention will be able to identify this. 

20 One embodiment of the present Invention comprises circularising a target nucleic acid fragment having 
termini capable of ligation to each other, the target nucleic acid fragment containing a portion of known 
nucleotide sequence, such known nucleotide sequence or portion thereof being capable of serving as an 
initiating priming region for hybridisation with an Initiating primer, hybridising an initiating primer to the said 
initiating priming region and subjecting the hybrid thus formed to primer extension in the presence of 

25 appropriate nucleoside triphosphates and an agent for polymerisation of the nucleoside triphosphates under 
hybridising conditions. For example 2 Initiating primers oriented in the opposite sense to each other than 
that normal for PGR may then be used to effect amplification. 

In this regard circularisation of the target nucleic add fragment may be regarded as the preparation of 
the target nucleic acid fragrhent/vectorette unit by ligation and thus for example primer extension may.be ^ 

30 performed based on the circularised target nucleic acid fragment as template. If desired however, the 
circularised target nucleic acid fragment may be cleaved and primer extension performed based on the 
cleavage product as template as described in Nucleic Acids Research Vol 16 pp 8186 1988 which was 
published after our UK Patent Application No. 8818020.3 from which convention priority is claimed. Further. 
If desired the cleavage product may be used to prepare a target nucleic acid fragment^ectorette unit by 

35 ligation and initiating primer extension performed based on the target nucleic acid fragment/vectorette unit 
thus obtained as template. All such embodiments are within the scope, of tiie present invention. 

Thus for example the circularised target nucleic acid fragment formed may be cleaved to yield a 
cleavage product which contains at least a portion of known nucleotide sequence such portion being 
capable of serving as an Initiating priming region for hybridisation with an initiating primer in which case for 

40 example the circularised target nucleic acid fragment may be cleaved within ttie region of known nucleotide 
sequence to fonn a linear molecule having known sequence at its termini flanking unknown sequence: It 
may be cleaved at a terminus of the known nucleotide sequence to form a linear molecule having a known 
nucleotide sequence at one terminus and a known cleavage site pattern at the other terminus which renders 
It capable of ligation to a vectorette: or more preferably it may be cleaved outside the known nucleotide 

45 sequence to form a linear molecule containing the known nucleotide sequence and having a known 
cleavage site pattern at at least one terminus, normally both termini, which enables the terminus or termini 
to be ligated to form a target nucleic acid fragment/vectorette unit. Thus a preferred embodiment of the 
present invention comprises circularising a target nuclek: acid fragment having termini capable of ligation to 
each other, the target nucleic acid fragment containing a portion thereof being capable of serving as an 

50 initiating priming region for hybridisation, with an Initiating primer; cleaving the circularised target nucleic 
acid fragment outside Uie known nucleotide sequence to form a linear molecule containing the known 
nucleotide sequence and having a known cleavage site pattern at at least one terminus for ligation to form a 
target nucleic acid fragment/vectorette unit; forming said target nucleic acid fragment'vectorette unit by 
ligation and treating the target nucleic acid fragment/vectorette unit, together or sequentially, with appro- 

55 priate nucleoside triphosphates and an agent for polymerisation of the nucleoside triphosphates under 
hybridising conditions. 

This preferred embodiment of the invention Is of interest in relation to circularised target nucleic acid 
fragments of for example at least t kb up to about 20 kb but Is of particular Interest in relation to larger 
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Circularised target nucleic acid fragments for example of from 20 kb up to about 120 kb such as circularised 
target nucleic acid fragments of about ICQ kb. The upper size limit is dependent on practical considerations 
and IS thus based on the maximum size of target nucleic acid fragments which can be circularised Such 
crculansation may be effected by known techniques [F.S. Collins (1988) Genome J\naly sis - A oractical 
approach (Editor K. Davies) p73.94. IRL Publishers. Oxford], for example by effecting ligation at low 
concentration. Th,s preferred embodiment may thus be effected by for example digesting genomic ONA 
with a restnction enzyme, preferably a restriction enzyme which cuts relatively infrequently (for example 
average fragment size 10-20 kb) such as Xbal. Kpnl or BamHI. The fragments may then be self ligated to 
form circles. The circles may then be cut with a restriction enzyme, preferably a restriction enzyme which 
cuts frequently (for example Hinfl) and the fragments ligated to a corresponding (for example Hinfl) 
vectorette. Thus for example if sequence adjacent to a known Xbal site is available, sequence adjacent to 
the next possible distant. Xbal site can be obtained by analysis of the vectorette product. In this way. a 
series of jumps may be made providing additional start points for the method, tt will be appreciated that the 
ability to make such jumps represents a consideratile advantage over PCR techniques as previously 
descnbed in so far as it may be possible to obtain amplified vectorette products at a considerable distance 
from the site of the initiating primer. Thus the method is not limited by the difficulty encountered in routine 
use of obtaining standard PGR products of more than about 5 kb. 

In order that the present Invention may be more fully understood it is described hereinafter, by way of 
example, with reference to the accompanying drawings, in which:- 

Rgure 1(a) illustrates a double stranded target nucleic acid. Figure 1(b) illustrates the restricted target 
nucleic acid. Rgure t(c) illustrates the target nucleic acid fragmentvectorette units obtained by ligation of 
the target nucleic acid restriction fragments and the vectorette. and Figure 1(d) illustrates hybridisation of (1) 
the initiating pnmer and (ii) vectorette primer with the target nucleic acid fragment/vectorette units. 

Figure 2 illustrates one embodiment of the invention; Rgure 2(a) depicting visualisation of restriction 
fragments obtained by partial digestion of genomic DNA and subjected to gel electrophoresis. Rgure 2(b) 
depicts the mixture of different types of high molecular weight fragments size selected for. further 
processing. Rgure 2(c) depicts the mixture of products obtained following ligation to form target nucleic 
acid fragment/vectorette units. 

Rgure 3 illustrates one embodiment of the invention which seeks to increase the relative concentra- 
tion of restriction fragments capable of hybridising to the initiating primer. 

Rgure 4 illustrates a further embodiment of the invention which avoids the use of Si nuclease and 
which seeks to increase the relative concentration of restriction fragments capable of hybridising to both the 
initiating primer and vectorette primer. Rgure 4<a) depicting the mixture of products which might be 
obtained foHowring treatment of the restriction fragments produced by complete digestion of the target DNA 
sequence of Interest with blocking vectorette in the presence of DNA ligase. Rgure 4<b) depicts a blocking 
vectorette U» use in the present invention after digestion with the restriction enzyme Eco Rl and Rgure 4<c) 
depicts the products obtained by complete digestion of the mixture of products depicted in Rgure 4(a). 

Rgure 5 shows two different vectorette portions for use in ttie method of the present invention 
vectorette portion (i) is a bkx:king double stranded vectorette having a short top (5) sfrand having a 3' 
terminal group capable of bkxking polymerisation and a bottom (3') strand forming a tail, the 3' terminal 
group of the top (5) strand being capable of blocking polymerisation in the. presence of nucleoside 
triphosphates, and an agent for polymerisation of the nucleoside triphosphates, under hybridising con- 
ditions; vectorette portion (ii) is double sti-anded but having a degree of non-complementarity between the 
stands. 

Rgure 6 shows a schematic representation of the application of the present invention to the 
vectorette portions of Rgure 5. 

Figure 7 shows a schematic representation of the use of vectorette libraries. 

Rgure 8 shows the schematic application of the present invention to amplification of circularised 
target nucleic acid fragments. 

Rgure 9 depicts amplification of part of ttie o-i antitrypsin gene using oligonucleotides 58 and 61 
(Step 6 in Metiiod I) and an Xbal vectorette unit/library. 

Rgure 10 shows the restriction map of the piasmid B3(pUC8 with a 440 bp insert in the EcoRI site) 

Rgure 11 (i) and (ii) are photographs of an agarose gel showing the amplified fragments obtained in 
Examples i. 5 and 7 hereinafter. 

Rgure 12 shows a sequence from the phenylalanine hydroxylase gene flanking exon 9 and the 
relative positions of binding for the oligonucleotides 58. 63. 66. and 59. 

Rgure 13 shows a portion of a photograph of an agarose gel revealing the amplified product obtained 
according to Example 8. 
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Rgure 20 


shows 
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of 


an 
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gel 


revealing 


the amplified 


products obtained according 



to Example 15. 

15 Rgure 21 shows an autoradiograph of a sequencing gel showing the sequence of the products 

generated in Example 16. 

Rgure 22 shows the nucleotide sequence data generated from the products described in Example 

16. 

Rgure 23 shows a section of the phenylalanine hydroxylase gene flanking exon i and indicates the 
20 relative positions of the oligonucleotides 58. 62 and 67 as well as the ccoRI restriction site used in 
vectorette unit construction. 
' Rgure 24 

(a) is a photograph of, an agarose gel showing the primary amplification product obtained according 
to Example 17: 

29 (b) is a photograph of an agarose gel showing the secondary amplification products obtained 
according to Example 17. 

Rgure 25 shows the nucleotide sequence data generated from the products described in Exarripie 

17. 

25(a) shows sequence read with oligonucleotide 62 (as hereinafter defined). Rgure 25 (b) shows sequence . 

30 read with oligonucleotide 67 (as hereinafter defined) and Figure 25 (c) shows the overall sequence read with 
oligonucleotides 62 and 67. 

Rgure 26 shows a section of the o-l antitrypsin gene flanking exon V and showing the relative 
positions of the oligonucleotides 58. 60. 68. 69 and 70 as well as the EcoRI restriction site for annealing and 
ligation of the vectorette. 
35 Rgure 27 

(a) is a photograph of an agarose gel showing the primary amplification product obtained according 
to Example 18: 

(b) is a photograph of an agarose gel showing the secondary amplification product obtained 
accordfrtg to Example 18. 

40 (c) is an autoradiograph of a nylon filter, blotted from the gel shown in Rgure 27(b) and probed 

with an a-1 antitrypsin protje. 

Rgure 28 shows the nucleotide sequence of 05. an anonymous fragment on chromosome 7 of the 
human genome, indicating the relative positions of oligonucleotides 71 and 72. 

Rgure 29 shows a restriction map of the region containing 05. The relative positions of 
js oligonucleotides 7i and 72 are indicated. The positions of the products synthesised accordingly to Example 
19 are also shown, 

Rgure 30 is a photograph of an agarose gel showing the amplified products obtained according to 
Example 19. 

Rgure 31 shows the nucleotide sequence generated from the Bel 1 product obtained as described in 
so Example 19. The position of the Hind III and Hae III sites predicted from restriction enzyme digests is 
shown. The position of oligonucleotide 73 which would be used as an initiating primer in subsequent cycles 
of vectorette unit amplification is also indicated. 

Rgure 32 is a photograph of an agarose gel showing the amplified products obtained according to 
Example 20. 

55 Rgure 33 shows the nucleotide sequence generated from the EcoRl product obtained as described 

in Example 20. 

Figure 34 is a photograph of an agarose gel showing the fragments obtained after digestion with 
restriction enzymes of the EcoBi product obtained as described in Example 20. 
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Figure 35 is a photograph of an .agarose gel showing the amplified products obtained accordino to 
Example 21. ^ 
Figure 36 

(a) shows a restriction map of a section of Hjnd.lllcosmid vectorette library, indicating the positions 
5 of oligonucleotides 58. 76 and 77. 

(b) shows a photograph of an agarose gel showing the amplified products obtained accordino to 
Example 22. 

Figure 37 

(a) shows a restriction map of a segment of target genomic ONA ligated to an EcoRi vectorette. 
to The positions of oligonucleotides 27. 76. 78 and 79 are indicated. 

(b) . shows a restriction map of a segment of target genomic ONA ligated to an EcoRi vectorette. 
The positions of oligonucleotides 78.79 and 84 are indicated. 

(c) . shows a segment of the Phenylalanine Hydroxylase gene exon 9 ligaied to an EcoRi 
vectorette. The positions of oligonucleotides 63 and 79 are indicated. 

' shows :a restriiction map of the segment of target genomic ONA (described in 37(c) above) 

ligated to a modified EcoRi vectorette. The positions of oligonucleotides 76. 79. 80 and 8i are indicated. 

■ (e) shows a restriction map of a segment of target genomic ONA (described In 37(b) above) ligated 
to a modified EcoRi vectorette. The positions of oligonucleotides 79. 80, 81 and 84 are Indicated. 

(f) shows a segment of the' Phenylalanine Hydroxylase gene exon 9 ligated to a modified EcoRi 
so vectorette. The positions of oligonucleotides 63. 79. 80 and 81 are indicated. 

(g) shows a restriction map of a segment of target genomic ONA ligated to a modified Hind ill 
vectorette. The positions of oligonucleotides 76. 78 and 79 are indicated. 

(h) shows a restriction map of a segment off target genomic ONA figated to a modified Hind III 
vectorette. The positions of oligonucleotides 76. 79. 81 and 82 are indicated. 

25 (i) shows a restriction map of a -segment of target genomic ONA ligated to a modified Hind (It 

vectorette. The positions of oligonuclebtides 76, 79. 81 and 83i are indicated. 

Fgure 38 shows a photograph of an agarose gel revealing the amplified products obtained according 
to Examples 23 (a), (b), (c). (d)» (e). (f). (g); (h> and (i). 

Figure 39 

30 (a) shows a restriction map of the target gisnomic ONA seqtience containing the polymorphic Pst.l 

site detected by the probe KM19. The polyrhorphic Pst 1 site is indicated by an asterisic.* 

(b) shows the formation of circular producte by ligation of Pst 1 cleaved target genomic ONA 
fragments. 

(c) shows the construction of Hind III vectorette unit librarieis by ligation of oligonucleotides to Pst i 
35 circles cleaved by Hind III Rgure 40 shows a photograph of an agarose gel revealing me amplified products 

obtained according to Example 24. 

Rgure 41 showS' a photograph of an agarose gel obtained by taking a sample from lane 3 of 
. Figure 40, precipitation and loading onto a fresh agarose gel. 

Rgure 42 shows a photograph of an agarose gel revealing the amplified products obtained according 
40 to Example 25. 

Rgure t (a) shows schematically a double stranded target nucleic add of for example 100 kb or more, 
the sequence of which is unknown except for ttie initial^ s' nucleotides as shown hatched, for example about 
20 nucleotides. A locus of interest X. of unknown sequence is shown at an unknown distance from ttie 5 - 
end. The target nucleic acid is subjected to restriction'(see Rgure 1(b)) using a 6 bp cutter such as EcoRI 

4s which is expected to cleave a nucleic add. on average, every 4096 bp. The recognition sequence for EcoRI 
is QAATTC. the Cleavage point being indicated by the arrow. EcoRI therefore cleaves the target nucleic acid 
to yield cohesive ends. The cohesive end of the target nucleic acid restriction fragments are then ligated to 
an appropriate vectorette as shown in Rgure i(c) the vectorette being hatched. The vectorette may be of 
any convenient nucleotide sequence as described above but will have at its S end a nucleotide sequence 

50 pattern consistent with ensuring ligation witti the cohesive ends of the restriction fragments refered to 
above. Thus if EcoRI is used as the restriction endonuclease the.3'-end of the restriction fragments and the 
5 end of the vectorette will have ttie sequences 

55 Restriction 5» - G. , aaTTC - 3' Vectorette 

fragments 3« - CTTAA ' C - 5* 
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Preferably that portion of the restriction endonuciease recognition pattern which does not take part .n 
the ligation is altered in the vectorette to destroy the recognition pattern. Thus any resiricUon endonuciease 
present for the purpose of cleaving the target nucleic acid is unadle to cleave the restriction 
fragment/vectoreite units once formed. Where the restriction endonuciease used is EcoRl therefore the 
5 terminal end of the vectorette is preferably 

5» AATTA 3' or5'AATTT3' or 5' AATTC 3« 



the CG portion of the recognition pattern being changed to AT or the CG being transposed to that in the 
EcoRI recognition pattern. The restriction fragmentfVectorette units are then treated together or preferably 
sequentially (with the intiating primer added first) with initiating primer and vectorette primer under 
hybridising conditions (see Fgure i (d)). The initiating primer will only be capable- of hybridising to that 
portion of the one target restriction fragment ' containing a known sequence (see Figure 1 (d) (i)). The 
vectorette primer is designed to hybridise to the nucleic acid strand which is formed by extension of the 
initiating primer (and as shown in Figure 1(d) the nucleic acid strand which is complementary to the nucleic 
acid strand to which the initiating primer is capable of hybridising). The vectorette primer will hybridise to 
the vectorette priming region of each restriction fragmentfVectorette unit present (see Figure 1 (d) (ii)). but in 
the presence of appropriate nucleoside triphosphates and an agent for the polymerisation of the nucleoside 
triphosphates, amplification will only take place in respect of the restriction fragment^ectorette unit to which 
the initiating prirner was hybridised then extended. It.will be .appreciated that preferably the hybridisation (ii) 
will only occur in use after the hybridisation (i). extension in the presence of nucleoside triphosphates and 
an agent for polymerisation , of the nucleoside triphosphates and -subsequent denaturation. Tliis prevents one 
situation apparent from Figure 1(d) where the vectorette primer (H) is seen to hybridise to and therefore be 
capable of forming an extension product with every target, nucleic add fragment^vectoretie unit present jn 
the complex mixture, ft is thus preferably to treat the vectonstte unit.first with the initiating prime and then, 
after ttier steps detailed at)Ove to add the vectorette primer 

It will therefore be possible to identify the amplified target fragment and if desired sequence the whole 
fragment, for example, by techniques described above. If desired, however, only the 3'end of the amplified 
target restriction fragment need be sequenced so that a new initiating prinrwr may be designed. This new 
initiating primer may then be used to obtain amplification of a new target restriction fragment obtained by 
digesting the target nucleic acid with a different restriction endonuciease. preferably also a 6 bp cutter, and 
the method of the present invention repeated. In this way a target nucleic acid may be sequenced from the 
5 end without the need, for cloning and with the direction of sequencing constantly and consistently 
proceeding in the 5 to 3 direction. By repeating the method of the present invention therefore it is possible 
to rapidly reach and sequence the aforementioned locus, X, of interest 

Fgure 2 illustrates one embodiment of the present invention: -The DNA sequence of interest is 
subjected to partial digestion for example with Ecofll and the restriction fragments obtained subjected to gel 
electrophoresis as depicted in Figure 2{a). Electrophoresis proceeds in. the direction indicated by the arrow 
and thus high molecular weight restriction fragments appear towards the top of the gel whilst the lower 
molecular weight restriction fragments appear towards the bottom of the gel. The X-axis represents an 
increasing concentration of restiiction enzyme or prolonged incubation. Restriction fragments of less than a 
predetermined size, for example to kb. are discarded and the higher molecular weight restriction fragments 
are taken through to the next stage. These higher molecular weight restriction fragments will constitute a 
mixture of overiapping partiai digestion fragments, as depicted in Figure 2(b). restriction sites being at a 
restriction fragment terrninus marited (i- ) or within the fragment, marked (v). All the restriction fragments 
•vi!! have been cut. usi.?g t-he zarr.e single restriction endonuciease. fc eitamp'e EccR!. and thus eac.*: 
fragment will have cohesive ends characteristic of the restriction endonuciease employed to effect the 
digestion. Only the extreme 5 . and 3' fragments will have a single such cohesive end whilst the remaining 
fragments will each have two such cohesive ends. A vectorette as hereinbefore defined in which the 
restriction endonuciease recognition sequence has been desti-oyed may then be mixed witfi the above- 
mentioned, target restriction fragments in the presence of a ligase to yield a mixture of products as 
exemplified in Figure 2(c). the vectorette being shown as a terminal box. It will be appreciated that the 
presence of ligase enables not only, ligation of the target restriction fragments with the vectorette to be 
effected, but also enables ligation of the target restriction fragments themselves. It will be appreciated 
therefore that under the conditions of ligation in Rgure 2 it is not unlikely that segments A and B in the 
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initial target nucteic acid fragmenlvectorelte units will not be adjacent In the original target genomic nucleic 
acid but will have come together anomalously during ligation. The products of ligation are then subjected to 
complete digestion with the restriction endonuciease employed to effect the partial digestion, for example 
EcoRl. The products of complete digestion will in general be double stranded target restriction fragments 

s having a vectorette ligated at each end of the fragment and target restriction fragments having a vectorette 
at only one end of the fragment. There will be a background population of target restriction fragments, for 
example fragments A and B in Rgure 2(c) which will not have a vectorette on either end. The former 
products will constitute a minor proportion of the total mixture of products. On denaturation only a trace of 
product, vectorette units will be able to hybridise with the initiating primer, but a very substantial proportion 

10 of the product mixture may be capable of hybridising with the vectorette primer unless substantial 
modification of the vectorette is undertaken as described below. Amplification of the desired restriction 
fragment will therefore be possible . but the substantial proportion of the vectorette unit library capable of 
hybridising with the vectorette primer may result in the need to use substantial quantities of the vectorette 
primer, of the agent for polymerisation of the nucleoside triphosphates (e.g. Taq DNA polymerase) and 

;s the nucleoside triphosphates themselves. This difficulty may be obviated to some"extent by the use of smaii 
quantities of starting material, but it. would be advantageous to selectively increase the relative proportion of 
the target restriction fragment/vectorette units capable of hybridising to the initiating primer In the product 
mixture. 

The rationale for the approach to constructicn of a vectorette unit library illustrated in Rgure 2 is as 

20 follows. Total digestion with a 6 bp cutting enzyme will generate a population of fragments with a mean size 
of 4096 bp. Ugation of vectorette units to this population (approximately 10* fragments from total human 
genomic DNA) would be expected to generate many fragments with a vectorette unit at each end. Many of 
these will be considerably smaller than 4096 bp given the Gaussian distribution of restriction sites. In use a 
vectorette primer in these circumstances might act as both primers in a startdard PGR. l.^ge amounts of 

25 spurious amplification products would be generated as well as any product arising from an initiating primer. 
In the discussion hereinbefore reference has been made to methods in which the synthesis of vectorette 
primer amplification products is dependent upon the initial synthesis of an extension product of the initiating 
primer. In the Figure 2 embodiment the vectorette unit itself may hybridise with and initiate copying from 
tfie vectorette primer. The above disadvantage of PGR product formation from target nucleic acid fragments 

30 with vectorette portions at each end Is overcome by a size fractionation step such that any such vectorette 
unit constructs have the 2 identical vectorettes separated by such a distance (eg greater than lO kb) that 
standard PGR Is so inefficient that no signlficam product is generated. 

Rgure 3 illustrates one embodiment whkih seeks to increase the relative concentration of resb^lction 
fragments capable of hybridising to the initiating primer in the vectorette unit product mixture obtained as 

35 described above in relation to Rgure 2. In Rgure 3 (a), (i) is a restriction fragment containing an initiating 
priming region (1) and has a vectorette portion (hatched) (2). (11) is a restriction fragment having a vectorette 
portion (hatched) (2) ligated at each end, but no initiating priming region and (iii) Is a restiiction fragment 
having no initiating priming region, but having a single vectorette portion (hatched) (2). In this regard the 
product mixture is treated, after denaturation (see Rgure 3b). with intiating primer only (3). in the presence 

-«o of an agent for polymerising nucleoside triphosphates, such as Taq DNA polymerase, and nucleoside 
triphosphates. No vectorette primer is employed at tttis stage. The target restriction fragmentvectorette 
units capable of hybridising to the Initiating primer are ttius selectively replicated (see Rgure 3 (c)): no 
replication of fragments without the necessary initiating priming region being possible. The product mixture 
thus obtained is then subjected to tt-eatment' wjth a single-strand specific endonuciease such as Si 

45 nuclease (see figure 3 (d)). This endonuciease breaks the internal phosphodieister bonds in a single 
stranded ONA and thus results in the production of blunt-ended double sfanded DNA. This process 
increases the relative concentration of the target restriction fragmentvectorette units capable of hybridising 
to the initiating primer over tiie remaining background fragments. Moreover If desired this process step of 
denaturation. treatment with initiating primer only In tiie presence of an agent for polymerising nucleoside 

so triphosphates and nucleoside triphosphates followed by treatment of the product mixture obtained with a 
single-stranded specific endonuciease such as. Si nuclease may be repeated as many times as thought 
desirable or necessary to increase the relative concenti-ation of the target restiiction fragmentvectorette 
units capable of hybridising to the initiating primer. When the concentration of such vectorette/ units is 
considered to be sufficiently high In relation to the remaining background fragments the product mixture 

55 may be additionally treated with vectorette primer to effect amplification as described in US Patents Nos 
4.663,195 and 4.683.202. , 

A disadvantage of the above-proposed technique is max Si nuclease tends not to be as single-strand 
specific as might be desirable and thus even some double stranded ONA may be degraded. 
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Furthermore linear amplification would not be appropriate in relation to the technique of Rgure 3 since 
only a s.ngle round of mitiating primer extension would yield Si-nuclease resistant double stranded ONA 

Figure 4 illustrates a further embodiment of the present invention which avoids the use of SI nuclease 
and wh,ch seeks to increase the relative concentration of restriction fragments capable of hybridising to the 
jn.tiatmg pnmer .n the product mixture obtained by complete restriction endonudease digestion of the taroet 
DMA sequence of interest. The advantage of this embodiment is that no fractionation of the fragmems 
obtained .s necessary nor is gel electrophoresis required as an initial step. Following complete digestion of 
the ONA sequence of interest with a restriction endonudease such as Ecofll the fragments ob^ned are 
treated w.th a blocking vectorette (as hereinbefore defined) in the presence of a ONA ligase to yield a 
complete mixture of products as depicted in Figure 4(a). It will be appreciated that the fragments A and B 
w|n not necessarily be adjacent in the original ONA sequence but may have been joined anomalously in the 
ONA hgase reaction. The bkxrking vectorette consists of a double stranded oligonucleotide sequence as 
depicted in Rgure 4(b) having a first cohesive end at one end thereof adapted for ligation to the 
corresponding cohesive ends of the restriction fragments, but in which that part, of the restriction 
endonudease recognition site which does not take part in the ligation is altered in the vectorette to destroy 
the recognition site such that the restriction fragment^ectorette unit once formed cannot be cleaved by the 
restriction endonudease which gave rise , to the restriction fragments. The other end of the blocking 
vectorette comprises a terminus having a blocking residue which might preferably be a s'-dideox- 
ynudeoside such as dideoxyadenosine (ddA). or any other 3' residue such that primer extension cannot 
proceed there, for example a 3 -deoxynucleoside or other chemical modification known per se. The lower 
strand overlaps the upper strand as shown in Figure 4(b). This strand preferably terrSISktM in a non- 
phosphorylated residue to avoid problems of self ligation. The region (4) possess the same sequence as the 
vectorette primer such that the vectorette primer is able to hybridise with the extension product of an 
initiating pnmer but not with the unextendable upper strand of the 3'-modified blocking vectorette. The 
25 blocking vectorette Is conveniently prepared as two separate strands which are subsequently hybridised 
The first and second strands are each prepared separately, for example manually, or conveniently using a 
ONA synthesiser, the first strand being 5 phosphorylated by chemical means, conveniently enzymaticllly 
for example by the use of polynudeotide kinase, and a terminal transferase being conveniently used to 
introduce a 3 -terminal dideoxynucleotide or other modified nudeotides. Alternatively other" blocking moiet- 
ies (for example amino) may be introduced chemically at the 3' end by methods known per se. The first 
and second strands thus prepared are then altowed to hybridise to fbrm the double-stfandid blockina 
vectorette. ^ 

The mixture of products depicted in Figure 4(a) and obtained folk)wing treatment of the restriction 
fragments with blocking vectorette in the presence of a ONA ligase is subjected to complete digestion with 
5 the same restriction endonudease used to aeate the restriction fragments prior to formation of the 
restriction fragmentA/ectorette units whereby the types of products (i). (ii) and (ill) depicted in Figure 4(c) are 
obtained. In order to reduce the relative concentration of restriction fragments (iii) present in the mixture of 
restriction fragmentA/ectorette units (i) and (ii) the above mentioned cyde may be repeated as many times 
as thought necessary or desirable to fonn a pool of restriction fragment^rectorette units for example based 
) on the use of the restriction endonudease EcoRl. 

Figure 5 shows in (i) a blocking vectorette comprising a 14 base top oligonucleotide and a 42 base 
bottom oligonudeotide annealed together with the four base overhang (a 5' overhang) complementary to 
the ends produced in the cleaved target ONA. W. N^. N\ N\ N^ and N^' are as hereinafter defined in 
Table 1. The 3 end of the 1 4 base oligonucleotide is modified (X) such that 5 - 3' extension from that end 
cannot be earned out using any of the known ONA polymerases eg Klenow fragment. T7 ONA polymerase 
(sequenase) or Taw ONA polymerase. This modified base, may be a dideoxynudeoside or a 3'-deox- 
ynudeoside (eg cordycepin). The 3 sugar residue may. if desired, be modified by any chemical methods 
known per se. 

The 3 terminal base of the bottom 42 mer and the complementary base at position 5 (from the 5' end) 
I in the 14 base top oligonucleotide are chosen such that following ligation of the blocking vectorette to the 
cleaved target ONA the resulting hexanucleotide (at the ligation junction) is not recognised by the restriction 
endonudease used to cleave the genomic ONA originally. This is important when ensuring that every 
cleaved genomic ONA fragment has a blocking vectorette ligated to its ends. Obviously, the ligation 
products will contain cleaved genomic ONA fragments re-ligated to each other and these are digested again 
.n order to generate receiving ends for the blocking vectorette in a second ligation reaction. This restriction 
and ligation cycle may need to be repeated more than once, for example three to four times to ensure 
every cleaved genomic ONA fragment has' a blocking vectorette at either end. It is preferred that each 
cleaved genomic ONA fragment has a blocking vectorette at either end so as to prevent random printing 



30 



20 



EP 0 356 021 A2 



from any unblocked 3 -ends of genomic ONA fragments in me reaction mixture after denaturation and 
treatment with an initiating primer. ' * 

The blocking vectorette shown in Figure 5 (i) is suitable for target genomic DNA cleaved with restriction 
endonucieases that produce 5* overhangs. For restriction endonucleases that produce a 3' overhang the 
5 blocking vectorette woukj have to be modified slightly. Thus for example the 14 base top oligonucleotide 
might be replaced by a 10 base lop strand (missing the 4 base overhang at the 5' end) and the 42 base 
bottom oligonucleotide strand might be replaced by a 46 base bottom strand, with the extra 4 bases being 
at the 3 end and being^ complementary to the ends produced by the restriction endonuciease used to 
cleave the genomic target Ot^A. 
10 Rgure 5 (ii) shows a non-complementary vectorette portion consisting of two annealed single stranded 
sequences (a 57 base and a 53 base oligonucleotide) which possess a degree of non-complementarity sue- 
that vectorette primer extensiorv cannot take place immediately. The discussion above in relation to int 
blocking vectorette of Figure 5 (i) applies exactly to the non-complementary vectorette portion with the 
exception that the non-complementary vectorette portion need not possess any modified nucleosides. 

rs Rgure 8 shows a schematic representation of the application of the present invention to the vectorette 
portions of Figure 5 (t). Genomic target DNA is digested to completion witti a single re'stnction en- 
donuciease to produce fragments as shown in (i). The blocking vectorette portions (5 in Figure 5(i)J are 
ligated onto the ends of every cleaved genomic target ONA fragment (ii) in the presence of DNA ligase. 
Amplification is then carried out after denaturation using the primers of known nucleotide sequence x and y 

20 in the presence of for example Taq polymerase. ^ ~ 

Thus In Figure 6 (iii). x represents the initiating primer and possesses ttie same or at least substantially 
the same sequence as the region marked IP (initiating primer) in strand 1. x is capable of hybridising to the 
initiating priming region (IPR) in strand 2. Strand 2 which contains the initiating priming region (IPR) also 
contains a portion of nucleotide' sequence which is the same or at least substantially the same as the 

25 nucleotide sequence of tiie vectorette primer (VP), in use. primer extension of x yields a strand which 
contains a vectorette priming region (VPR) suitable for hybridisation with the^vectorette primer (VP) 
represented by die primer y. Since strand l does not contain a vectorette priming region itself and cannot 
be subject to primer extension because of the presence of the polymerisation blocking moiety and strand 2 
also has no vectorette priming region, no primer extension of a vectorette primer can take place until a 

30 vectorette priming region is created by primer extension of tiie initiating primer. 

it will therefore be appreciated that in the first amplification cycle only the primer x can produce an 
extension product extending up to the end of the 42 base oligonucleotide of the blocking vectorette (iii). 
Primer y Is redundant in the first cycle since there is no complementary sequence for it to hybridise to and 
produce extension products. The sequence of primer y is exactiy the same as bases 2 and 31 from the s' 

as end of the 42 base bottom oligonucleotide in Figure 5 (i) and bases 1 3 and 42 inclusive of the 53 base 
bottom oligonucleotide in Rgure 5 (ii). m ttie second cycle and ttiereafter (v and iv) the primer y can 
hybridise to the extension product of primer x and complementary synthesis of ttiis extended product (iv) 
may proceed. Thus primer y cannot hybridise to any of the ligation products shown in (iii). It can only 
hybridise to an essentially "completely extended product of primer x which is tite known nucleotide 

« sequence in the locus of interest Thus only an amplification product containing primer x, unknown 
nucleotide sequence 2 and primer y is produced. The nucleotide sequence of z can be determined from 
both ends by using primer x and y"as sequencing primers. Alternatively "nested" sequencing primers x' 
and y may be prepared. These wlircontain sequence 3' of primer x or 3 of primer y for example bases 2*5 
to 42 from the 5 end of the 42 mer oligonucleotide in Rgure 5(i). 

^5 Rgure 6 illustrates the method of the present invention by reference to a target nucleic acid 
fragmenfvectorette unit in which the vectorette portion contains a polymerisation btocking moiety. Similar 
considerations apply where the vectorette portion comprises at least a region of non-complementarity as 
hereinbefore described and as illustrated by way of example in Rgure 5 (ii). Thus strand i will possess the 
same or at least substantially the same sequence as the region mari<ed IP. An initiating primer (IP) will be 

50 capable of hybridising to the initiating priming region (IPR) in strand 2. Strand 2 which contains the IPR also 
contains a portion of nucleotide sequence which is the same or at least substantially the same as the 
nucleotide sequence of the vectorette primer (VP), but this portion of sfrand 2 does not hybridise to the 
corresponding portion of strand 1 because of the degree of non-complementarity deliberately introduced 
into strand i. Thus the vectorette primer (VP) hybridise to strand 1 since strand 1 contains no vectorette 

55 priming region (VPR). nor can it hybridise to strand 2 because strand 2 contains a portion of sequence 
which is the same as VP. but no complementary sequence or VPR. Thus no vectorette primer extension 
products can be formed until after a vectorette priming region (VPR) has been created by primer extension 
of the initiating primer (IP) 
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The above described process is then preferably repeated using a different restriction endonucJease and 
different correspondingly designed blocking vectorettes to form a different pool of restriction 
fragment vectorette units constituting a different vectorette library. 

This process may be repeated separately for any desired number of different restnction endonucteases 

s (for example 10 to 30, conveniently 15. to 25. advantageously about 20) and correspondingly designed 
blocking vectorettes to form as many different libraries of restriction fragment vectorette units. The different 
libraries of restriction fragment vectorette units may then be mixed to form a plurality of libraries of such 
hybrids. The plurality of libraries may then be treated with an initiating primer under hybridising conditions 
in the presence of nucleoside triphosphates and an agent for the polymerisation of the nucleoside 

ro triphosphates such as Taw DNA polymerase. The addition of vectorette primer under the conditions 
appropriate for potymeraisechain reaction (PCR) as described in US Patents I^s 4.683.195 and 4.683.202 
results in amplification of the loci of interest only and such amplified fragments may be subjected to 
sequencing, thus enabling a nevr initiating primer to be designed and the above-mentioned process to be 
repeated using the new initiating primer as a starting point for amplification and. if desired sequencing of a 

rs further region of interest, the procedure being repeated as desired. 

Figure 7 illustrates the above by showing a schematic representation of the use of vectorette libraries. 
Genomic ONA Is digested to completion with the restriction endonuclease EcoRl . Vectorette portions with 4 
base overhangs complementary to the 4 base overhangs produced by EcoHl cleavage, are ligated onto the 
ends of every cleaved genomic ONA fragment. This is called an EcoRl vectorette library. Similar libraries 

20 are constructed using for example about tS to 20 restriction endonucteases recognising hexanucieotide 
sequences. When these libraries are pooled and subjected to amplification a range of PGR products may 
be obtained, or they can more preferably be used individually and separately to produce a single PGR 
product from each library. ^ 

Rgure 7 (a) shows a hypothetical restriction map to the 3' side of a locus of interest of known 

25 nucleotide sequence (w). Upon amplification of pooled vectorette libraries using primers x and y the PGR 
products Z. 21. Z2. Z3 and Z4 shovvn in (b) are produced. Bectrophoresis of these PGR products on an 
agarose gel would produce the band p>attem showti in (c) where the PGR reactions are run separately on 
each vectorette library rather than on a pooled plurality of libraries. The largest PGR product visualised on 
the gel (Z4 in. this instance) may be gel purified and its 3' end nucleotide sequence determined using 

30 oligomer y or a "nested* oligomer y as the sequencing primer. This nucleotide sequence, at the 3 end of 
Z4. couldlhen serve as or be used for the synthesis of a new initiating primer x in further walking steps 
using this method. 

Fgure 8 shows the schematic application of the present invention to inverse PGR. inverse PGR having 
been latterly described in Triglia T. et al. Nucleic Acids Research Vol t6 1988 page 8186 and by Ochman 

55 H. et al, Genetics 120: 621-623 (November 1988). 

Genomic DNA is digested with a restriction enzyme which preferably cuts relatively infrequently (eg. 
average fragment size 10-20 Kb) such as Xbal. Kpnl or BamHI. The fragments at low concentration are self 
ligated to form circles. The circles are then cut with a restriction enzyme which preferably cuts frequently 
(such as Hinfl) and the fragments are ligated to a con-esponding vectorette. for example a Hinfl vectorette 

40 vectorette. Thus for example if sequence adjacent to a known Xba site is available, sequence adjacent to 
the next Xba site can be obtained by analysis of the vectorette unit product, in this way, a series of jumps 
could be made pnDviding additional start points for the method of the present invention. Whilst an average 
fragment size of 10-20 kb is referred to above by way of example it would be preferable to cleave the target 
nucleic acid to give even larger fragments for example of about tOO kb. Such fragments may for example 

45 be obtained by the use of restriction enzymes which cut genomic ONA very infrequently such as Notl. 
BssHll. and Sal I. 

Figure n(i).is a photograph of an agarose gel showing in lanes i and li the marker *xi74 cleaved 
with Haelll, in lane 12 the marker X*Hind III. in lane 2 a PGR control and in lane 9 the amplified fragment 
obtained using oligonucleotides 58 and 61 in Example l and as depicted in Figure 9. 

50 Figure 11 (ii) is the upper portion of a photograph of an agarose gel showing the marker x,Hind ill in 
lanes l and 13, the marker «xl74 cleaved with Hae III in lanes 2 and 12. a PGR control in lane 3 and the 
amplified fragment obtained using oligonucleotides 58 and 61 in Example l (see Rgure 9) in lane 10. 

Rgure n(iii) is the lower portion of a photograph of an agarose gel showing the marker \,Hind ill in 
lanes 1 and ii. the marker ^xi74 cleaved with Hae III in lane 6. the amplified fragment obtained according 

55 to Example 5 in lanes 2. 3. 4 and 5 and the amplified fragment obtained according to Example 7 in tanes 7. 
8. 9 and i0. 

The remaining Rgures are discussed in detail in the Examples hereinafter. 

The present invention will now be illustrated, but not limited, by reference to the following methods and 
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examples in which the foHowIng oligodeoxynudeotides detailed below were used; each nucleotide sequence 
stated herein being read in the conventrpnal 5 —3' sense:- 

Oligonucleotide Type A (a set of 14 base torig oligonucleotides with 4 or 5 end bases at the 5' end varying 
in sequence and thus' suitable for use with s' overhangs. The 4 end bases represent the s' overhang whilst 
5 the fifth end base is present to destroy the restriction enzyme recognition site). 
Specifically used were: 



Oligonucleotide l 

70 

CTAGGAAGGAGAGG 

(for use with DNA digested with the restriction endonucleases Xba I or Nhel or Spef) 
Oligonucleotide 2 

AATTGAAGGAGAGQ ^ 

(for use with ONA digested with the restriction endonuciease Ecofll) 
Oligonucleotide 3 

20 

GATCGAAGGAGAGG 

(for use with ONA digested with the restriction endonucleases BamHI or Bgl II or Xho II or Bdl). 
Oligonucleotide 4 

2S " 
AGCTGAAGGAGAGG 

(for use with DNA deaved with the restriction endonuciease Hind III) 

* Oilgonucleotide 5 
30 " 
TCGAGAAGGAGAGG 

(for use with DNA cleaved with the restriction endonuciease Sal I) 

Oligonucleotide 6 
05 - " 

CGCGGAAGGAGAGG 

(for use with ONA cleaved with the restriction endonuciease Mlul or 8ss HII) 
Oligonucleotide 7 

40 ~ . . 

GTACGAAGGAGAGQ 

(for use with restriction endonuciease Asp7i8 cleaved ONA) 

Oligonucleotide 8 

TCGAGAAGGAGAGG 

(for use with ONA cleaved with the restriction endonuciease Xhol) 

Oligonucleotide 9 

50 

CATGCAAGGAGAGG 

(for use with ONA cleaved with the restriction endonuciease Ncot) 
Oligonucleotide to 

55 

GGCCCAAGGAGAGG 
. (for use with DNA cleaved with the restriction endonuciease NotI or Eag 1) 
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Qtisonudeotide 1 1 



CCGGTAAGGAGAGG 

(for US0 wrth ONA cleaved with the restriction endonucleases Bspm II or Xma I or Acc III) 
Oligor^ucleotide t2 



CATGTAAGGAGAGG 

(for use with DNA digested with the restriction endonuclease Bsp HI or Nco I) 

to 

Oligonucleotide 13 
CTAGTAAGGAGAGG 

(for use with DNA digested with the restriction endonucleases Avr II. Nhel or Xba I) 
f5 Oligonucleotide Type B (a set of 42 base long oligonucleotides with the 3* end base varying in sequence, 
the oiigonucieotides being suitable for use with S overhangs). 
Specifically used were:- 



20 

Oligonucleotide 14 

CGAATCGTAACCGTTCGTACGAGAATCGCTGTCCTCTCCTTC 

This oligonucleotide is to be used with oiigonucieotides 1 . 2. 3. 4. 5. 6 and 7. 

25 

Olrgonucleotide 15 

CGAATCGTAACCGTTCQTACGAQAATCGCTGTCCTCTCCTTQ 

(The nucleotide sequence is the same as 14 except the 3' terminal residue C is replaced by G). This 
30 oligonucleotide is to be used with oligonucleotides 8. 9 and 10. 

Oligonucleotide 16 



CGAATCGTAACCGTTCGTACGAGAATCGCTGTCCTCTCCTTT 
35 (The nucleotide sequence is the same as 14 except the 3' terminal residue C is replaced by T). 

Oligonucleotide 17 



CGAATCGTAACCGTTCGTACGAGAATCGCTQTCCTCTCCTTA 
40 (The nucleotide sequence is the same as 14 except the 3' terminal residue C is replaced by A). This 
oligonucleotide is to be used with oltgonucleotides 11, 12. 13 and 18 (defined hereinafter). 

Oligonucleotide 18 (10 mer) 

45 TAAGGAGAGG 

Oligonucleotide tS is annealed to oligonucleotide 17. They are designed for use as a vectorette unit with 
DNA digested with a restriction endonuclease capable of creating blunt ends) 
„ Oligonucleotides 1 9 and Type C series nligonur.teotides are suitable for use with 3 overhangs: 

so Oligonucleotide 19 (iO mer) 

5'- AAA GGA GAG G-3' 

This oligonucleotide is designed for use with oligonucleotides 20 to 25 as hereinafter defined. 
55 Oligonucleotide Type C (a set of 46 base long oligonucleotides) 



Oligonucleotide 20 
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5 -CGA ATC GTA ACC GTT CGT ACQ AGA ATC GCT GTC CTC TCC TTT TGC A-i 
(for use w.m ONA digested with the restriction endonucleases Nsil or PsJiT 

Oligonucleotide Oligonucleotide 21 

i 

S -CGA ATC GTA ACC GTT CGT ACG AGA ATC GCT GTC CTC TCC TTT AGC T.V 
(for use with ONA digested with the restriction enonucleases Sti J sLiT 

Oligonucleotide 22 

S'-CGA ATC GTA ACC GTT CGT AGC AGA ATC GCT GTC CTC TCC TTT CAT G-3' 
(tor use with ONA digested with the restriction endonudease Sphi) 

OUgonucieotide 23 

"^^f n^f "^^^ Q^^'T CTC TCC TTT GTA C-S' 

(tor use With ONA digested with the restrictior) endonudease Kpn I) 

Ollgonucteotide 24 

20 — — — — 

S'^GA ATC GTA ACC GTT CGT ACG AGA ATC GCT GTC CTC TCC TTT ACG T-s' 
(tor use with ONA digested with the restriction endonudease Aat li) 

Oligonucleotide 25 

2S 

5 -CGA ATC GTA ACC GTT CGT ACG AGA ATC GCT GTC CTC TCC TTT GGC 0-3 
(tor use with ONA digested with the restriction endonudease Apa 1) 

Oligonudeotide Type D 
30 " 

This fS a set of 57 base long oligonucleotides with 4 to 5 bases at the s' end only varying in sequence. 
Oligonucleotide 26 

35 CTAGGAAGGAGAGGACGCTGTCTGTCGAAGGTAAGGAACGGAGGAGAGAAGGGAGAG 

To be used with Oligonucleotide 40 (defined hereinafter) and ONA digested with Xbal. Nhe I or Spe I. 

Oligonucleotide 27 

* Tn ^MS" ^^'^ QGA GQA GAG AAG GGA GAG-3' 

To be used with oligonucleotide 40 and ONA cleaved with EcoRI. « ««« .oo« uAti 3 

Oligonucleotide 28 

'5 5'-TC6AGA AGG AGA GGA CGC TGT CTG TCQ AAG GTA AGQ AAC GGA GGA GAG AAG GGA GAG 3' 
To be used with Oligonucleotide 40 and ONA cleaved with Sail. ^ 

Oligonucleotide 29 

SO S-CGC GGA AGG AGA GGA CGC TGT CTG TCQ AAG GTA AGG AAC GGA GGA GAG AAG GGA GAG- 
To be used with oligonucleotide 40 and ONA cleaved with Bss Hll or Mlul. 
Oligonucleotide 30 

55 

5 -AGC TGA AGG AGA GGA CGC TGT CTG TCG AAG GTA AGG AAC GGA GGA GAG AAG GGA GAG-a' 
To be used with Oligonucleotide 40 and ONA cleaved with Hind III. ^'^^ wve AAG GGA GAG-3 
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Qiigonucteotioe 3i 

5' -GAT CGA AGO AGA GGA CGC TGT CTG TCG AAG GTA AGG AAC GGA G6A GAG AAG GGA GAG-3 
To be used with oligonucieoiide 40 and DNA cleaved with Bam HI. Bell or Bgl II. 

5 

Oligonucleotide 32 

5*.CCG GGA AGG AGA GGA CGC TGT CTG TCG AAG GTA AGG AAC GGA GGA GAG AAG GGA GAG 
To be used with oligonucleoUde 40 and DNA cleaved with Acc III or BspM II. 

to 

Oligonucleotide 33 

S'.TGC AGA AGG AGA GGA CGC TGT CTG TCG AAG GTA AGG AAC GGA GGA GAG AAG GGA GAG-3' 
To be used oligonucleotide 40 and DNA cleaved with Apa LI. 

IS 

Oligonucleotide 34 

S'.TCG ATA AGG AGA GGA CGC TGT CTG TCG AAG GTA AGG AAC GGA GGA GAG AAG GGA GAG-S' 
To be used with oligonucleotide 41 and DNa cleaved with Xho I. 

20 

Oligonucleotide 35 

S'-GGC CTA AGG AGA GGA CGC TGT CTG TCG AAG GTA AGG AAC GGA GGA GAG AAG GGA GAG-3' 
To be used with oligonucleotide 41 and ONA cleaved with Eag I or Not I or Xma 111. 

29 

Oligonucleotide 36 

5 <:CG GTA AGG AGA GGA CGC TGT CTG TCG AAG GTA AGG AAC GGA GGA GAG AAG GGA GAG-3' 
To be used oligonucleotide 41 and ONA cleaved with Bspm II or Xma I or Acc Ml. . 

30 

Oligonucleotide 37 

S'-CAT GTA AGG AGA GGA CGC TGT CTG TCG AAG GTA AGG AAC GGA GGA GAG AAG GGA GAG-3' 
To be used with oligonucleotide 4i and DNA cleaved with Bsp HI or Nco I. 

3S 

Ofiqonucleotide 38 

S'-CTA GTA AGG AGA GGA CGC TGT CTG TCG AAG GTA AGG AAC GGA GGA GAG AAG GGA QAG-3' 
To be used with oligonucleotide 41 and DNA cleaved with Avr II. Nhei or Xbal. 

40 

Oligonucleotide 39 

5 -TA AGG AGA GGA CGC TGT CTG TCG AAG GTA AGG AAC GGA GGA GAG AAG GGA GAG-3' 
To be used witti oligonucleotide 41 and DNA cleaved with a restriction endonuclease capable of creating 
4S blunt ends. 

Oligonucleotide Type £ 

This a set of 52 base long oligonucleotides with a varying 3 terminal residue. 

50 

Oligonucleotide 40 

5 -CTC TCC CTT CTC GAA TCG TAA CCG TTC GTA CGA GAA TCG CTG TCC TCT OCT TC-3' 
To be used witii oligonucleotides 26 to 33. 

55 

Oligonucleotide 41 

S'-CTC TCC CTT CTC GAA TCG TAA CCG TTC GTA CGA GAA TCG CTG TCC TCT CCT TA-3' 
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To be used with oligonucleotides 34 to 39. 
Oligonucleotide Type F 

5 This is a 53 base long oligonucleotide for use with the set of oligonucleotide Type G (43 to 48) as 
hereinafter defined and is for use with 3* overhangs. 

Oligonucleotide 42 

ro 

5 -AAAGG AGA GGA CGC TGT CTQ TCG AAG GTA AGG AAC GGA GGA GAG AAG GGA GAG-3' 
Oligonucleotide Type G 

rs This is a set of 57 base long oligonucleotides with 4 bases at the 3 end varying in sequence. 
Oligonucleotide 43 

5'-CTC TCC CTT CTC GAA TCG TAA CCG TTC GTA CGA GAA TCG CTG TCC TCT CCT TTT GCA.3' 
20 To be used with oligonucleotide 42 and ONA cleaved with Psd or Nsil. 

Oligonucleotide 44 

S'-CTC TCC CTT CTC GAA TCG TAA CCG TTC GTA CGA GAA TCG CTG TCC TCT CCT TTA GCT-3' 
25 To be used with oligonucleotide 42 and ONa cleaved with Sstl or Sact. 

Oligonucleotide 45 

5'-CTC TCC CTT CTC GAA TCG TAA CCG TTC GTA CGA GAA TCG CTG TCC TCT CCT TTC ATG-3' 
30 To be used with oligonucleotide 42 and ONA cleaved with Sph I. 

Oligonucleotide 46 

5 -CTC TCC CTT CTC GAA TCG TAA CCG TTC GTA CGA GAA TCG CTG TCC TCT CCT TTG TAC-3 
35 To be used with oligonucleotide 42 and DNA cleaved with Kpn I. 

Oligonucleotide 47 

5'-CTC TCC CTT CTC GAA TCQ TAA CCG TTC GTA CGA GAA TCG CTG TCC TCT CCT TTA CGT-3' 
40 To be used with oligonucleotide 42 and ONA cleaved with Aat \l 

OligonuclgPtide 48 

S'-CTC TCC CTT CTC GAA TCG TAA CCG TTC GTA CGA GAA TCG CTG TCC TCT CCT TTG GCC-3' 
45 To be used with oligonucleotide 42 and DNA cleaved with Apa I. 

The restriction endonucleases indicated are to be used for cleaving the ONA for use with that particular 
oligonucleotide or pair of oligonucleotides. Thus oligonucleotide 1 is to be used witii for example genomic 
ONA cleaved witii Nhe I or Spe I or Xba I. 

All the oligonucleotides for use wim 5 overhangs (ttiat is Type A. 0 and F (42) and oligonucleotides 18. 
so 19 and 39 are preferably phosphorylated. 

Oligonucleotides 4&'57 are used in the construction of alternative vectorette units. 



Oligonucleotide 49 

55 

5'-AAT TGA AGG AGA GGA CGC TGT CTG TCG AAG GTA AGG AAC GGA GGA G-3' 
Oligonucleotide 50 
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5 -CGA ATC GTA ACC GTT CGT ACQ AGA ATC.GCT GTC CTC TCC TTC-3' 
Oligonucleotide 51^ 

5 5'-AAT TGA AGG AGA GGA CGC TGT CAG AGG ACG GTT ACG AAC GTA GGA CAG AAG GGA GAG-3' 
Oligonucleotide 52 

AAT TGA AGG AGA GGA CGC TGA CTG TCG AAC GTA CGG ATA GGA GTC GAG AAG GGA GTC GAG 
to AAG GGA GAG-3 

Oligonucleotide 53 

S'-AAT TGA AGG AGA GGA GAG AAG GGA CGC TGT CTG TCG AAG GTA AGG AAC GGA GGA-3' 

Otigcnucleotlde 54 . 

S'-CGA ATC GTA ACC GTT CGT ACQ AGA ATC GCT TCC CTT CTC TCC TCT CCI TC-3' 
20 Oligonucleotide 55 

S'-AAT TGA AGG AGA GGA CGC TGT CTG TCG AAG GTA AGG AAC GGA GGA GAG AAG GGA GAG 
AAA GAG GAA GGG AAG-3' 

25 Oligonucleotide 56 , 

S'-CTT CCC TTC CTC TTT CTC TCC CTT CTC GAA TCG TAA CCG TTC GTA C6A GAA TCG CTG TCC 
TCT CCT TC-3 

30 Oligonucleotide 57 

S'-AAT TGA AGG AGA GGC AGA AGG GAG AG-3' 

Alternative vectorette units 

35 

Oligonucleotide 49 should be annealed to oligonucleotide 50. 
Oligonucleotide 51. 52 and 57 should be annealed to oligonucleotide 40. 
Oligonucleotide 53 should be anneaJed to oligonucleotide 54. 

Oligonucleotide 55 should be annealed to oligonucleotide 56. These oligonudeotides would be used with 
40 genomic ONA cleaved with EcoRI. 
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O ^ CN 

^ »-4 f-4 

In Table 1. N'. N^. N^. and N* represent four nucleotides present to provide cohesive ends capable of 
ligation to a con-esponding restriction site on a target nucleic acid fragment. The nucleotide and N*' are 
deliberately selected so as to destroy the restriction site recognition pattern. Thus for example:- 

n1n2n3n^n5 may represent AATTC 
n5» C 

in respect of the EcoRI restriction site, and 
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N^N^nSn^nS aay represent AGCTG 
N5' C 



in respect of the Hind IK restriction site. 

Where N'.N^.N^.nVNS and Ns' represent the above mentioned ecoRl restriction site recognition 
pattern, the specific vectorette units 1 -10 are detailed below in Table 2. each oligonucleotide being labelled 
by its appropriate identication number. In vectorette units 11 and 12 of Table 2 NVN2,N3.N*.N* and N«' 
,3 represent the above-mentioned Hind III restriction site recognition . pattern and each oligonucleotide is 
labelled by its appropriate identification number. 
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GHgonucleotides 5a. 59 and 60 may be used with all constructs. 
Oligonucleotide 58 is a universal vectorette primer. 

eg- 
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( 40 ) CrrCCTCTCCTGTCGCTAAGAGCATGCTTGCCAATGCTAAGCTCTTCCCTCTC 

3* I n 1 1 1 1 I I I M I 1 r II I It I tl 1 1 1 1 1 1 1 5* 

5 TCGCTAAGAGCATGCTTGCCAATGCTAAGC 

3' 58 5* 

Oligonucleotide 59 is a universal sequencing primer and a nested vectorette primer 

'0 eg. 

(40) CTTCCTCTCCTGTCG CT A 

1 1 I I I M I I 1 1 I M I 

15 

3« TTCCTCTCCTGTCGC 5* 
5i 

^ Oligonucleotide 60 is a nested universal vectorette primer, 
eg. 

(40) 3 • CTTCCTCTCCTGTCGCTAAGAGCATCCTTGCCAA 

3' 60 ^ 5- 

Short oligonucleotides are the I4mer + 42mer annealed. 
30 Long oligonucleotides are the 57mer ••• 53mer annealed. 

Type A and Type 0 are equivalent short and long oligonucleotides respectively. TTierefore i and 26 have 
the same- overhang and are both to te used with DNA cleaved with Nhe I. Spe I or Xba I. Similarly 2 and 27 
are both to be used with DNA cleaved with EcoRI. 
- Exactly the same applies to Type B and Type E. 19 and Type F (42) and Type C and Type G, 
3S Each oligonucleotide is to be ligated to ONA cleaved with the restriction endonuclease indicated next to that 
oligonucleotide. Types A and B (and Type D and Type £) are for use with 5 overhangs. 

Oligonucleotides 19 + Type C (and Type F + Type G) are for use with 3' overhangs. 
Oligonucleotides 1 to 8 are paired with oligonucleotide 14. 
9 to 13 and 18 are paired with oligonucleotide 17. 
*o 19 is paired with oligonucleotides 20 to 25. 
26 to 33 are paired witti oligonucleotide 40. 
34 to 39 are paired with oligonucleotide 41. 
Type F (42) paired with Type G oligonucleotides 43 to 4a 

Oligonucleotide 7 (and 32) may be replaced by oligonucleotide li (and 36) for use with Bspm II 
^ restricted ONA. 

It will be appreciated that oligonucleotides 1 to 7 are for annealing to oligonucleotide 14. 
Oligonucleotides 8 to 10 are for annealing to oligonucleotide 15. Oligonucleotides ii to 13 are for annealing 
to oligonucleotide 17, It may.not be necessaiyjo use oligonucleotide 16. 

50 

Oligonucleotide 58 

CGAATCGTAACCGTTCGTACGAQAATCGCT 

(a 30 base long oligonucleotide intended as the "universal vectorette primer* In the polymerase chain 
S5 reaction (PGR) outiined in step 6 of Method I hereinafter). 

Additional Oligonucleotides used in the Examples hereinafter 
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These oligonucleotides may be used with any • vectorette tibrary irrespective of the restriction en- 
donuclease employed except for oligonucleotide 65 as detailed below:- 

5 Oligonucleotide 61 

GAGACTTGGTATTTTGTTCAATCATTAAG 

(a 3 oligonucleotide of exon V of the a-l antitrypsin gene. 

to Oligonucleotide 62 

AAAAGCCAGAGACCTCACTCCCGGGGAGCC 

(a 5 oligonucleotide of exon 1 of the human Phenylalanine Hydroxylase gene, mutations in which cause 
phenylketonuria (PKU)). 

Oligonucleotide 63 

AGGGACTTACTGTGGCGAGCTTTTCAATGT 

(a 3 oligonucleotide of exon 9 of the phenylalanine hydroxylase (PAH) gene) 

20 

Oligonucleotide 64 

GGGCCTCAGTCCCAACATGGCTAAGAGGTQ 

(a 3 oligonucleotide of exon III of the al antitrypsin gene) 

25 

Oligonucleotide 65 

AATTTCACACAGGAAACAGCTATGACCATG 
(an extended Ml 3 reverse primer). 
30 This is to be used as one of the PGR primers for ampSficatiofi of Inserts in either Ml 3 ONA or pUC plasmid 
ONA. 

Oligonucleotide 66 

J5 5'atc agg tgc acc gag AGA GGC AAG GCC-3' * 
(a 3' oligonucleotide of exon 9 of the phenylalanine hydroxylase (PAH)gene). 

Method ! 

40 Method I comprises the following six steps:- 

Step I • Phosphorylation of oligonucleotide Type A, followed by gel purification of the phosphorylated 
species. ' 

Step 2 • Addition of a ddA residue to the 3' end of the phosphorylated oligonucleotide Type A of step 
1 followed by gel purification. 
45 Step 3 - Annealing of the modified oligonucleotide Type A to the corresponding oligonucleotide Type 

B. 

Step 4 • DNA preparation and restriction digestion. 

Step 5 - Ligation of the annealed oligonucleotides Type A and Type B to the cleaved de- 
phosphorylated ONA of step 4. ' 
50 Step 6 - Amplification of the ligation products 0* step 5 and detection and analysis of tiie PCH 

products. 

Such steps may for example be effected as follo«rs:- 

55 Step 1 

The following buffers and other standard solutions were used:- 
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M lOX kinase buffer 



0.5M Tris.HCI (pH 7.6) 
O.IM MgCis 
5 50 mM dithiothreitoi 
1 mM Spermidine 
1 mM EDTA 
stored at -20*'C. 

10 1.2 Adenosine 5 '-triphosphate 



Sodium salt as a 10 mM solution (Pharmacia) 
Diluted to 20 pMol/ul with water, 
stored at -20'C. 

IS 

1.3 Radioisotopes 

a) Adenosine 5 -[7^^P]triphosphate (Amersham) Specific activity approx 6000 Ci>mmol. 

b) Otdeoxyadenosine 5'-[a-2p]triphosphate (Amersham) Specific activity approx 3000 Cl^mmol. 

20 

1.4 Formamide Sample Buffer 

80% (v;v) deionised formamide 
SO mM Tris.borate pH 8.3 
25 1 mM EDTA 

0.1% (w/v) Bromophenol Blue 
0. 1 % (w/v) xylene cyanol 

The formamide was deionised by stirring for 30 minutes with 2 g Amberlite M8*1 resin (Bio-Rad) and 
filtering off using a Whatman number 1 filter. The buffer was stored in the dark. 

30 

1.5 40% Stock Acrytamide per 500 ml; 



Acrylamide (electrophoretic grade 190 g) 

Bis-acrylamide (electrophoretic grade 10 g) ' ■ 

35 The acrylamide was stirred for 30 minutes with 5 g of Amberlite MB-1 resin (Bio-Rad) and filtered using two 
sheets of Whatman Number i . 
The stock solution was stored in the dark at 4*>C. 



1.6 10% Ammonium Persulphate 

40 

1 g of ammonium persulphate (Sigma, electrophoresis grade) was dissolved and made up to 10 ml in sterile 
deionised water. It was stored in the dark. 



l.7 0.5X Gel mix 


(20% acrylamide. 7M urea) per 50 ml: 


21 g 


Urea 


25 ml 


40% stock acrylamida (1.5) 


2.5 ml 


i0xT8E(1.8) 


Just before use add polymerising agents: 


300 III 


10% ammonium persulphate (1.6) 


40 al 


TEMED 
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1.8 tOX TBEper fitre:- 




Trls 


109 g 




EOTA 


9.5 g 


5 


Boric Acid 


55 9 




lOx TBE buffer was 




diluted in water to t x 


! ( 


or 0.5x strength as 


w 


required; 





1.9 Buffer A 


0.1M 


Tris.HCl pH 7.7 


10 mM 


Triethylamtne 


1 mM 


EDTA 


The buffer was stored at 4°C 


and made up fresh once a 


month. 





25 



1.10 Reagent 8 per 100 ml:* . 


50 ml 
50 ml 


ethanol 

sterile deionised water. 



Phospnofylation of the 14 mer oligonucleotide Type A 
The following were mixed in a 1 .5 ml Starstedt tut)e: 



Oligonucleotide Type A too pmoles 

1 0X kinase buffer (1.1) Su\ 

Adenosine s'-triphosphate ( l .2) 9.5 ul (1 90 pmoles) 

Adenosine 5'-{7^P]triphosphate (1.3) 5 ul (10 pmoles) 



45 Polynucleotide kinase E.coli 8 (Amersham) 40 units 
H2O to 50 ul. 

The ratio of oligonucleotides to ATP molecules was kept at i:2. 

Phosphorylation of the oligonucleotides was carried out by incubation at 37«C for 60 minutes. 

20 ai of the formamide sample buffer (1.4) was added and the enzyme sample' was denatured m a 
90°C water bath lor 10 minutes prior to loading on a 20% acrylamide 7M urea denaturing gel. 

Gel purification of the kinased oligonucleotides was performed Phosphorylated oUgonucleotides were 
purified by electrophoresis on denaturing polyacrylamide gels as described by Wu R. et al "Purification and 
sequence analysis of synthetic oligonucleotides* in Oligonucleotide Synthesis - a practicaT approach Edited 
by M J Gait. IHL Press. 



c) Rnal purification through a column 
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NeNsorb'TU (OuPont) nucleic acid purification cartridges were employed for this final purification step. 



1 ) pre^quiiibration of the column. 

5 

The column was clamped to a clamp stand.. The Inside of the cartridge was rinsed with 2 ml of HPLC 
grade methanol. This washed any loose packing back onto the column bed. 

The adaptor was securely pressed into the top of the cartridge forming an air tight seal. A disposable 
plastic syringe was filled with air and attached to the adaptor. With constant gentle pressure ail the 
to methanol was pushed through the cartridge {flow rate l drop per 2 seconds). In the same way the column 
was 'primed' by flowing 2 ml of buffer A (1.9) through it 



2) Sample loading. 

15 

2.B U.I of triethylamlne were added to the 2 ml of water containing the oligonucleotide. This was then 
added directly to the top of the colurnn bed using a syringe and needle. The sample was pushed through 
the column using a syringe filled with air as in C{1). 

20 

3) Sample wash. 

3 ml of buffer A (1.9) was added to the column using a syringe and needle. This was gently forced 
through the column as in C(i). 
2S 4) 1 ml of reagent B (I.IO) was applied to the top of the column. The ONA was eluted from the column 
bed by gentle force using a syringe filled. with air as in C(1). The effluent was collected in a 1.5 ml Sarstedt 
tube and evaporated to dryness in a centrifuge to which a vacuum is applied (UNIVAP. Uniscience). 

The dried, purified oligonucleotide was resuspended as required. 

STEP 2 

The following buffers and other standard solutions were used:* 

35 

2J 

1 .4M Sodium Cacodylate 
pH 7.5 with HCI 
40 Stored at •20^'C. 

22 

5 mM cobaitous chloride 
45 Stored at -20*0. 

2^ 

1 mM diihiothrerioi 
50 Stored at -20*C 

Oideoxy adenosine 5 -triphosphate Lyophilised sodium salt. 
55 (Boehringer Mannheim) 

Diluted to 10 mM stock solution in H3O 
and 1 mM working solution in H2O 
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^5 

0.5 M EDTA pH 8.0 

5 Addition of a dideoxyadenosine residue to the 3''end of the oiigonucliBOtide Type A. 

Oligonucleotide Type A was phosphorylated arid the phosphoryiated species purified as descnbed in 
step 1. WitK 80% recovery, there are 80 pmoles ol oligonucleotide Ictnased and available for the terminal 
transferase reaction. 

10 The 80 pmoles of kinased and purified oligonucteotide I were resuspended in 48 ul of sterile deionised 
water. 

The following were added to the Idnased oligonucleotide. 



30 



(48 ui 


oiigonuclectide (80 pmol) 


1 ul 


1 mfy/lc|dATP(1 nmol)(2.4) 


10 ul 


t.4M Cacodyllcacld (2.1) 


10 u! 


1 mM OTT (2.3) 


20 ul 


5 mM (^baJtous chloride (2.2) 


5ul 


ddATP (o32p) (i5 pmol) (1.3b) 


Sul 


Terminal Transferase 




(150 units oif Boehringer TdT) 



2S- The reaction mixture was incubated at 31^. tor 1 hr and the reaction was then terminated by the 
addition of 4 ul of O.SM EDTA. 

40 ul of the formamide sample buffer was added (1.4) and the sample denatured by incubating in a 
90^ water bath for 10 minutes prior to loading on a 20% acrylamide. 7M urea denaturing gel. 
Gel electrophoresis was exactly as described in step 1*. 
30 The now 15 mer oligonucleotide Type A band was cut out and purified exactly as descrit>ed in step 1. 
The final recovery is estimated to be SO pmoles. This is resuspended in 250 ul of sterile deionised water to 
give a concentration of 0.2 pmol/ui. 



35 Step 3 



Annealing of the modified oligonucleotide Type A to the corresponding oligonucleotide Type B 
40 The following solution was requiied. 





3.1 10 X Annealing buffer. 




. 100 mM 


Tris HCI (pH 8.0) 




100 mM 


MgClj 



The followlRg were mixed in a 1.5 ml screw cap Sarstadt tube> 

25 pmoles of modified Type A oligonucleotide (obtained via steps i and 2) 
25 pmoles of Type B oligonucleotide 
5 ul lO x annealing buffer (3.1) 
and HjO to SO ul. 

If the Type A oligonucleotide is selected from oligonucleotide i, 2. 3 or 4 then a convenient Type B 
oligonucleotide for these (1-4) will be oligonucleotide 14; 

The Eppendori tube was placed in a boiling water bath (conveniently a small beaker). After 5 minutes, 
the water in the beaker was allowed to cool slowly to room temperature over a period of 2 hours. Finally the 
contents of the tube were collected at the bottom by centrifugation for 10 seconds. The annealed 
oligonucleotide, at a concentration of 0.5 pmol/ul is ready for use in ligations. 
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Step 4 



DNA preparation and restriction 

The following QNA Samples were used:- 
1 . Human Genomic DNA I 

S " 'k*""""*' ^^'^ ^'^^ '"^ lymphoblastoid cell,) to which 0.. mg..ml o. 

The following buffers were used: 



4.1 lOx Low salt buffer 


100 mM 


Tris-HCl pH 7.5 


100 mM 


MgCl2 


10 mM 


Dlthioerythrttol 



4.2 lOx Medium Salt buffer 


100 mM 


Tris-HCl pH 7.5 


100 mM 




500 mM 


NaCi 


10 mM 


. Dithioerythritol 



4.3 lOx High Salt buffer 


500 mM 


Tris-HCl pH 7.5 


100 mM 


MgCl2 


1000 mM 


NaCi 


10 mM 


Dithioerythritol 



4.4 10X BamHt buffer 


100 mM 


Tris-HO pH 8.0 


50 mM 


MgCb 


1000 mM 


NaCi 


10 mM 


2-mefcaptoethanol. 



4.5 1 0x EcoRl buffer 


100 mM 


Tris-HCl pH 7.5 


too mM 


MgCl2 


500 mM 


NaCi 
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4.6 TE buffer 


10 mM 
1 mM 


Tris-HClpH 7.5 
EDTA 



to 



4.7 Agarose Sample Buffer 


15% (w/v> 


Rcolf 400 


0.05% (w/v) 


Bromophenol Blue 


0.05% (w/v) 


Xylene Cyanol 



15 

The sample buffer was made up in IX T6E buffer (1.8). 





4.8 tOX CIP Buffer 


20 


SOOmfs^ 


Tris-HCI pH 8.0 




1 mM 


EDTA 



(i) The following conditions were employed for cleaving the genomic DNA with specific restnctlon 
endonucteasas:- 

SOugofOt^ 

20 al of the appropriate 1 0X restriction buffer. 

50 units of the desired restriction endonuclease. 

The final volume of the reaction mixture was adjusted to SOOul. 

Digestion was effected at 37^ for 2'hrs.-The reaction was terminated by the addition of 4 ul of O.SM 
EDTA (2.5) and heating the sample to 70**C for 10 minutes. 

(ii) 1 ug of the sample was checiced by agarose gel electrophoresis to ensure complete digestion:- 



Agarose gel electrophoresis . 

The gels were made 0.7% (w/v) agarose (Pharmacia) in IX TBE (1.8) buffer and submerged in the 
same strength TBE buffer for electrophoresis. A fifth-volume of agarose sample buffer (4.7) was added to 
the sample prior to loading into the wells of the gel. Following electrophoresis the gel was stained for 10 
minutes by submerging in deionised water containing 1 ug/ml ethidlum bromide. The ONA was visualised 
at 302 nm using a transilluminator (Macrovue-LKB). 

(iii)(a) The digested ONA was purified by phenol extraction: 

An equal volume of buffered redistilled phenol (BRU nucleic acid grade phenol buffered with TE(4.6)} 
was added to the restricted DNA and the mixture vortexed for 30 seconds, the white emulsion formed was 
separated into two phases by centrifugation for 5 minutes. The aqueous phase was transfen'ed to a fresh 
tube. Any residual phenol was removed after first extracting the aqueous layer with phenol: chloroform: 
isoamylalcohol (25: 24: t) - an equal volume-, with an equal volume of chloroform: isoamylalcohol (24:1). 

(b) The DNA was finally purified by dthanc! precipitation: 

A tenth-volume of 3M sodium acetate was added to the OfiA solution to give a final concentration of 
300 mM sodium acetate. The ONA was precipitated by the addition of 2.5 volumes of 95% ethanot and then 
standing at -BO'C for 30 minutes followed by centrifugation (microcentrifuge for 5 minutes. 12.000 rpm). The 
pelleted DNA was washed with 80% (v/v) cold ethanol and dried under vacuum. The ONA was dissolved m 
buffer as required. 



(iv) Dephosphorylation of the cleaved DNA 



The cleaved ONA was dephosphorylated to prevent self ligation. 
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The restricted and purified ONA was dissolved in 175 ui sterile deionised water. 
To this DNA were added: 



Sal 
20 ul 



of Calf intestine phosphatase (CIP) (Boehringer Mannheinrt ) (l unifal) 
of lOXClP buffer (4.8) 



The sample was incubated at 37^ for 60 minutes. 4 ul of 0.5M EDTA was added and the 
10 dephosphoryiated ONA was purified by phenol extraction and ethanol precipitation as described in (iiiHa) 
and (iiixb) above. The final OHA pellet. was dissolved in 50 ui of TE buffer (4.6) to give a concentration of i 
ug/ul. 



/5 Step 5 



Ugation of the annealed oligonucleotides I & H to me cleaved dephosphoryiated DNA 
20 The following buffer was used 



5.1 1 OX ligation buffer 


200 mM 


Tris-HCl pH 7.4 


100 mM 


MgCIa 


100 mM 


DTT 


10 mM 


ATP 



The following were mixed in a 1.5 ml Sarstedttube 



Cleaved genomic ONA (7.4 pmole) 


lOug 


Annealed oiigonucleotides from step 3 


11 pmoie 


10X ligation buffer 


10 al 


T4 ONA ligase (Boehringer Mannheim) 


2 units 


Sterttsed deionised water to bring^volume to tOO ui 



40 

The ligation mix was left at Af*C overnight Efficiency of ligation was estimated by get electrophoresis of 
5% of the ligation products - see Step 4 (ii) for procedure details. 

The ligation products were then purified by phenol extraction and ethanol precipitation as described in 
Step 4(iii). The final DNA pellet was dissolved in 9.5 ul of TE buffer (4.6) to give a concentration of i ug/ul. 

Step 6 

50 Amplification of the ligation products of Step 5 

The following buffers and other standard solutions were used 



55 



44 



EP 0 356 021 A2 



6.1 lOX Amplification buffer 


500 mM 


KCI 


lOOmM 


Tris-HCl pH8.3 


15 mM 


MgCl2 


0.1% 


gelatin 


Stored at -20*0. 



rs 



8.2 Nucleotides 


tOX stock 


2 mM 


dATP 


2mM 


dTTP 


2mM 


dGTP 


2mM 


dCTP 


Stored < 


U 


•20^. 





6.3 Size marlcers 

Bacteriophage ^Xi 74 cleaved with HaeNI; 
' Size of bands in base pairs is> 
^ 1353. 1078. 872. 603, 310. 281.271.234. 194. 118 and 72. 

TTie following were mixed in a 1 .5 ml screw cap Sarstedt tube:> r 
1 ag of ligation products from step 5. 
1 00 pmoles of oligonucleotide 56 
100 pmoles of oligonucleotide 61 or 62 or 63 
10 ul 10X Amplification buffer (6.1) 
10 allOX Stock nucleotides. 

The volume was made up to 100 ul with distilled deionised water. 

The tube was placed in a boiling water bath for 5 minutes. Taq polymerase (Anglian) was diluted to i 
^ uniVui in IX Amplification buffer (8.1). 

2 uJ of the diluted enzyme were added to the above boiled reaction mix, and mixed. The contents were 
spun to the bottom in a microcerttrifuge for 5 seconds. 50 al of light mineral^oit (Sigma) was then placed on 
the reaction mixture (to prevent evaporation during the foltowing reactions). 

The tube was placed in a water bath at 

1) 60*C for 2 minutes 

2) 72**C for 3 minutes 

3) 91**C for 2 minutes Steps 1), 2) and 3) were then repeated 39 times. The tube was then placed in 
the water bath at 60'C for 2 minutes followed by incubation for 3 minutes at 72°C. At the end of the 
amplif.caticr: t.*:e contents wore spun to the bottom and the mineral oil pipetted off. 

^ 15 ul of the amplified products were analysed by agarose gel electrophoresis as described in Step 4<ji). 
To estimate the size of amplified products(s) oXi74 cleaved with Hae III (6.3) was electrophoresed adjacent 
to th<e amplified sample. 



METHOD 11 

55 

Method II comprises the following steps:- 
y Adding a ddA residue to the 3 end of oligonucleotide Type A foltowed by gel purification of the 3* 
end labelled species. 
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2- Phosphcryiating the 3* end labelled oligonucleotide Type A followed by separation from unmcor- 
porated nucleotides. 

3. Annealing of the modified oligonucleotide Type A to the corresponding oligonucleotide Type B. 

4. Cleavage of the genomic DNA. 

5 5. Ligating the annealed oligonucleotides A + B to the cleaved DNA. 

6. Amplification of the ligation products. Detection and analysis of the PGR products. 
The above steps may for example be effected as Follows:- 



to Step I 

Experimental details are exactly the same as step 2 of Method I. 



ts Step 2 

The oligonucleotide is phosphorylated as described in step I of Method I. The phosphorylated 
oligonucleotide is purified directty through a column as detailed in Step I (ii}C of Method I. 
Prior to loading the sample on the column. 200 ul of buffer A<1.9) is added to the reaction mix in place of 
20 triethytamine. 

Steps 3-6 are exactly the same as in Method I. 



METHOD III 
25 " 

The following steps are involved:- 

1. Phdsphoryiating the 14 mer oligonucleotide Type A followed by gel purification of the 
phosphorylated species. 

2. Adding a ddA residue to the 3 end of the kinased oligonucleotide I followed by gel purification. 
30 3. Annealing of the oligonucleotide Type A to the corresponding oligonucleotide Type B. 

4. Restriction of the DNA. 

5. Ugating the annealed oligonucleotides A + B to the restricted DNA, 

6. Addition of a dideoxynucleotide to free 3 ' ends. 

7. Amplification of the ligation products. Detection and analysis of the amplification products. 
35 Such steps may for example be effected as follows:- 

Steps 1 to 5 are exactiy as detailed in METHOD I. 



Step 6 

40 " 

The purified ligation products from Step 5 are dissoh^ in 30 ul of distilled deionised water. To the 
DNA are added the following:- " ^ 

5 ul of lOx amplification buffer (6.1) 
1 5 pmoies of the appropriate dideoxynucleotide 
45 1 unit of Taq polymerase (Anglian) 
water added to bring volume to 50 u.1. 

The reaction mix is incubated at 37 * C for 30 minutes. 

The DNA is purified by phenol extraction and ethanol precipitation as described in Method I Step 4 (iii). 
The DNA pellet is dissolved in 50 ul of TE buffer (4.6)' to give a concentration of 1 ugiu.1. 
50 Step 7 is the same as Step 6 of METHOD I. 



METHOD IV 

ss Exactly the same as METHOD II except that Step 6 of METHOD III is inserted between steps 5 and 6 of 
METHOD II. " " * 
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METHOD V 

Method V comprises the foHowing steps:- 

t. Phosphorylating the 57-mer Type 0 oligonucleotide followed by its separation from unincorporated 
5 nucleotide. 

2. Annealing of the phosphorylated. for example kinased Type 0 oligonucleotide to the appropnate 
Type £ oligonucleotide. Followed by Step 4 onwards of Methods I-IV. 
Such steps may for example be effected as follows:- 

1 . The oligonucleotide is phosphorylated as described in Method I step 1. Purification is through a 
to NeNsorbTM column as described in Method I Step i(ti)C. 

2. Annealing is as detailed in Method 1. Step 4 onwards are as detailed previously (METHODS I and 



/5 Method VI 

Method VI comprises the following steps:* . 

I. Phosphorylating either the Type A 14 mer oligonucleotide (or oligonucleotides 18 or 19 or 39 or 
Type F (42) or Type 0 followed by gel purification of the phosphorylated species. 
20 2. Annealing of the phosphorylated oligonucleotide for. 18. 19 or Type 0 to the corresponding 

appropriate oligonucleotide Type B (or Type Q or Type £. or Type G. 

3. Pestriction enzyme digestion of the genomic ONA. 

4. Ugation of the annealed oltgonucteotides to the restricted DNA, 

5. Restriction of ligation products from above step. 

25 8. Ligation of more annealed oligonucleotides to products from step (5). 

7. Repeat of steps (5) and (6). 

8. Final restriction of products firom step (7). 

9. Addition of a dideoxynucteotide triphosphate to the 3* end of the products from step (8). 

10. Amplification of the products from step (9). Detection and analysis of the amplified products. 
30 Such steps may for example be effected as follows:- 



Step 1 

35 As described in step i of method I. 



Step 2 

40 This step is the same as described in step 3 of method I. 
For exampler- 

Oligonucleotides 1 to 7 are annealed to 14 

Oligonucleotides 11 to 13 and 18 are annealed to 17 

Oligonucleotides 20 to 25 are annealed to 19 
*s Oligonucleotides 26 to 32 are annealed to 40 

Oligonucleotides 24 to 39 are annealed to 41 

Oligonucleotides 43 to 48 are annealed to 42 

so Step 3 

This step is the same as step 4(1). 4<ii) and 4(iii) of method I but restricted DNA is not dephosphoryiated. 
Step 4 

55 * 

Ligating the annealed oligonucleotides to the cleaved DNA Suffers and other standard sotutions: 
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4.1 lOx ligation buffer 


200 mM 


Tris-HCl pH 7.4 


tOOmM 


MgCIa 


100 mM 


OTT 


10 mM 


ATP 



(i) The calculation of pmoles of sticky ends In lug of cleaved ONA is as set out in step 5 of method i 



(ii) Ligations 

The following were mixed in a 1 .5ml Sarstedt tube: 



Cleaved genomic ONA 7.4pmoles ( lOug) 
Annealed oHgonucleotides from step 2 
lOx ligation buffer (4.1) 
T4 ONA ligase (Boehringer Mannheim) 
Sterilised deionised water to bring volume to 



11 pmoles 

2 units 
lOOul . 



2j The ligation mix was left aX 4*C overnight. 

Efficiency of ligation was estimated by gel electrophoresis of 5% of the ligation products - see step 4(ii) of 
method I for the procedure details. 

The ligation products were purified by phenol extraction and ethanol precipitation as described in 
section 4<iii) or method I. 



Step 5 

This step is the same as step 3 above. 
Step 6 
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This step is the same as step 4 above except equimolar concentrations of ONA to annealed 
^ oligonucleotides are used. 
For example 
7.4 pmoles of Of^ 

7.4 pmoles of annealed oligonucleotides 

45 

Step 7 

Repeat of steps 5 and 6 above. 

•50 

Step 8 

As step 5 above. 



Step 9 
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Buffers and other standard solutions 



9.1 T7 DNA polymerase 


buffer (Sx) 




200 mM 


Tris. HCI pH7.5 


100 mM 


MgClj 


250 mM 


NaCi 



9.2 




dNTP-dATP 


t mM 


dTTP 


1 mM 


dGTP 


1 mM 


dCTP 



made up in 

sterilised 

deionised 
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9.3 ddATP 



1 mM 



ddATP 



made up in 
sterilised 
deionised water. 



(i) Addition of a ddA residue to 3 ends of products from step 8:* 

The following were mixed in a 1 .SmI Sarstedt tuise:- 
4ug of DNA from step 8 above 
20ul of Sx T7 ONA polymerase buffer (9.1) 
20m of dNTP^TP (9.2) 
20al of ddATP (9.3) 

5 units of T7 ONA polymerase (Sequenase) 

Volume made up to lOOul with distilled deionised water. 

The tube was incubated at 37°C for 30 minutes. The enzyme was then denatured by incubation at 70^0 
for 10 minutes. 

The ONA was then purified by phenol extraction and ethanol precipitation as described previously. 



Step 10 

This step is thd same as step 6 of method I or the amplification may be carried out on a temperature 
cycling machine (for example the Techne programmable Ori-6lock PHC-1.) 

It will be appreciated that in this method VI dideoxyadenosine-s'-triphosphate is reacted with the 3* 
ends of the tigated oligonucieotides in the presence of T7 ONA polymerase (Sequenase) in order to replace 
any loss of 3 terminal ddA residues as a consequence of an unexpected exonuclease activity of the T4 
ONA ligase specific for dideoxynucleotides. 
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Method VII 



Method VII is ihe same as Method VI with the exception that step i of method Vf >$ omitted. Further, in 
this method no gel purification Is effected, but the efficiency of ligation may be checked by agarose gel 
5 electrophoresis and ethidium bromide staining. 



Method VIM 

Method VIII is the same as Method VI with the exception that step 9 is omitted. The oligonucleotides 
used in this method are for example as follows:- 
26 to 32 annealed to 40 
36 to 39 annealed to 41 
43 to 46 annealed to 42 



Method IX 

Method IX comprises the following steps:* 
20 1. Phosphorylation 

2. Annealing 

3. Restriction of ONA 

4. Ligation <for example at 25^) of annealed oligonucleotides to cleaved ONA 

5. Restriction/ligation 

25 6. Amplification of the products from step 5 and detection and analysis of the amplification products. 

The above steps may for example be effected as described in relation to the corresponding steps in 
Method VL 



00 Method X 

Steps 1. 2 and 3 of Method X are the same as steps l. 2 and 3 of method VIIL Step 4(i) of method X is 
also the same as step 4(i) of method Vlll but step 4<ii) is effected as follows:- 

35 

4<ii) Ligations 

The following were mixed in a 0:5ml Sarstedt tube:- 



Restricted 7387 DNA 


0.74 pmoles (lag) 


Annealed oligonucleotides 


1 .65 pmoles 


lOx ligation buffer (4.1) 


lui 


tOx ATP (4.2) 


tul 


T4 ONA ligase (Boehringer Mannheim) 


1 unit 


H20 to 


lOul 



The ligation mix was left at 25°C (in the Techne Ori-Block) for 120 minutes. 

The ligation mix was then diluted in water to a ONA concentration of lOng/ul and subjected to the 
amplification reaction as described in Step 10 of Method Vlll hereinbefore. 



Method XI 



Method XI is the same as method X except as follows:- 

Following the ligation reaction at 25*'C for 120 minutes the ligation products are digested with the 
restriction endonuciease by the addition of:- 2ul of buffer for the appropriate restriction endonuclease (see 
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3.1) 

2 units of restriction endonuclease 
HjO to 20al 

and incubation at 37°C for 60 minutes. 
5 The sanr^ple was then diluted in H2O to a ONA concentration of lOng ul. 

Method XH 

to Method XH is the same as Method X except that in step 4(ii). the ligation mix is left at 4»C overnight 
rather than at 25**C for 2 hours. 

Finally the ligation mix is diluted in water to a ONA concentration of lOngul. 

/5 Method XIII 

Method XIII is the same as method XII except that following the ovemight Ugation reaction at 4*'C. the 
ligation products are digested with the appropriate restriction endonuclease. 
This is done by: 
20 adding to the 1 0ul ligation mix 

2al buffer for the appropriate endonuclease (see 3.1) 
2 units of restriction endonuclease {Boehringer Mannheim) 
. HjO to 20ul 

and incubation at 37°C for 60 minutes. 
25 The sample is then diluted in Ha 0 to a ONA concentration of lOng^UL 

Example 1 

30 Oligonucleotides l and 14 were prepared as described in Steps 1 and 2 of Method I above. These 
oligonucleotides for use in Method I were annealed as described in Step 3 above. Human genomic ONA II 
was prepared and cleaved as described in Step 4. of Method I. Human genomic ONA II cleaved with 
restriction endonuclease Xba 1 and dephosphorylated (as detailed in Step 4 above) was ligated with the 
annealed oligonucleotides 1 and 14. exactly as described in Step (5) above. 

35 Amplification was then performed on the ligation products. 

Figure li(i) as hereinbefore described shows the marker ♦X174 cleaved with Hae III in lanes 1 and 1 1. 
the mari<er \ Hind III in lane 12. an amplification control in lane 2 and the amplified fragment obtained using 
oligonucleotides 58 and 61. This amplified fragment is 800 bp in length as expected. 

Figure 11 (ii) also shows the market X Hind 111 In lanes 1 and 13, the marker ^X174 cleaved with Hae ill 

40 in lanes 2 and 12. an amplification control in lane 3 and the amplified fragment obtained using 
oligonucleotides 58 and 61 in lane 10. 



Example 2 

45 

Using oligonucleotides l and 1 4 with ONA cleaved with the restriction endonuclease Xba I. 

a) Oligonucleotide l (100 pmoles) was phosphorylated and the kinased species purified as described 
in Step 1 Method I. 

b) The kinased oligonucleotide 1 was terminal transferased to add a dideoxy adenosine residue to the 
so 3 end of the oligonucleotide. The resulting i5-mer oligonucleotide 1 i+ddA at 3' end) was purified as 

detailed in Step 2 of Method 1. 

c) Oligonucleotide 1 was hybridised to oligonucleotide 14. 

d) Human genomic ONA II was cleaved with the restriction endonuclease Xbal and dephosphorylated 
as detailed in Method I. 

55 e) The annealed oligos 1 and 1 4 were ligated to human genomic ONA II restricted with Xbal - see 

Method I for details. 

f) Amplification was carried out on the ligation products using oligonucleotides 58 and 61. The 
expected amplified product in this case is approximately 800 bp. ~ *" 
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Example 3 



15 



using Oligonucleotide 2 and 14 with DNA cleaved wth the restriction er^donudease EcoRI. ' 
a> The ohgonucieot.de 2 was phosphoryiated and a ddA residue added ai the 3' end as descr.bed 
above in example (2). This modified oligonucleotide was then annealed to oligonucleotide 14 

b) Genomic DNA 1 was cleaved with the restriction endonuclease EcoRl and dephosphoryiated as 
described in Method I. h h u y 

c) The annealed oligonucleotides 2 + 14 were then ligated to genomic ONA I cleaved with Ecofll. 

d) Amplification was carried out on the ligation products using: 

{») Oligonucleotides 58 and 62. The expected product is about 220 bp. 
(li) Oligonucleotides 58 and 63 Tho expected product is 560 bp. 

Example 4 



Using oligonucleotide 3 and U with ONA cleaved with the restriction endonuclease BamHI. 

a) The oligonucleotide 3 was phosphoryiated and a ddA nucleotide added to the 3* end as described 
above in Example l . 

b) This prepared oligo was then annealed to oligonucleotide 14. 

20 c) Genomic ONA I was cleaved with the restriction endonuclease BamHI and dephosphoryiated as 

described in Method 1. 

d) The annealed oligonucleotides 3 and 14 were then ligated to genomic ONA I cleaved with BamHI 
(see Method I for details). 

e) Any free 3 ends in the ligation products were then blocked with a ddG residue using the procedure 
25 described in step 6 of Method 3. "~ 

f) Finally amplification. was carried out on these products using oligonucleotides 58 and 63. The 
product expected is about 200 bp. — — 



30 Example 5 

Using oligonucleotide 3 and H with DNA cleaved with the restriction endonuclease BamHI. 
a) Oligonucleotide 3 was prepared and annealed to oligonucleotide 14 as described above in 

example (4). 

35 b) pUC8 containing a 440bpinsert <see 83. Rgure 10) plasmid ONA was digested to completion with 

the restriction endonuclease BamHI and dephosphoryiated as detailed in Method I. 

c) Oligonucleotides 3 and U were annealed to 83 plasmid ONA cleaved with BamHI. 

d) A ddG residue was attached to 3'free ends as detailed in example 4. Amplification was carried out 
on the ligation products using oligonucleotides 58 and 65 Rgure ll(iii) hereinbefore described shows the 

40 marite. x Hind III in lanes land 11. the mariter 0X174 cleaved with Hae lit In lane 6 and the amplified 
fragment obtained according to Example 5 in lanes 2. 3. 4 arKl 5. As expected the product is about 440 bp 
in length. 



45 Example 6 

Using oligonucleotides 26 and 40 with DNA cleaved with the restriction endonuclease Xbal. 
^ a) Oligonucleotide 26 was phosphoryiated and the kinased species purified as described in Method 

50 b) The kinased oligo 26 was annealed to oligonucleotide 40 under conditions described in Sleo 3 of 

Method I. — 

c) Human genomic DNAII was restricted with Xba I and dephosphoryiated as detailed in Step 4 of 
Method I. 

d) The annealed oligos 26 & 40 were then ligated to genomic DNA II cleaved with the restriction 
55 endonuclease Xbal (see step 5 of Method I for details). 

e) Any free 3 ends in the ligation products were blocked with a ddA residue using the procedure 
described in step 6 of Method ill. 

f) Amplification was earned out on the ligation products with: 



52 



EP 0 356 Q21 A2 



(i) OligonucleoUdes 58 and 61 The expected product is about 800 bp. 

(ii) Oligonucleotides 58 and 64 The expected product is about 760 bp. 



s Example 7 

Using oligonucleotides 27 and 40 with ONA cleaved with the restriction endonudease EcoRI. 
a) Oligonucleotide 27 was phosphorylated. purified and annealed to oitgo 40 as described above in 
example 5. 

to b) Genomic ONA I was cleaved with the restriction endonudease EcoRI and dephosphorylated as 

detailed in step 4 of Method I. . . 

c) The annealed oiigos 27 and 40 were then iigated to genomic ONA i cleaved with EcoRI (see step 
5 of Method 1 for details). . . 

d) Any free 3' ends in the ligation products were blocked with ddA residue as described in step 'B of 
/5 Method III. 

e) Amplification was carried out on the ligation products using: 

(i) Oligonucieotides 58 and 62 The expected product is about 220 bp 
(Ii) Oligonucleotides 58 and 63 the expected product is 560 bp. 
Figure n(iit) as hereinbefore described shows the marker x Hind III in lanes 1 and 11. the marker 
20 ^X174 cleaved with Hae III in lane 6 and the arnplified fragments obtained according to Example 7 in lanes 
7. 8. 9 and 10. 



Example 8 

An EcoRI vectorette library was constructed according to Method vr and vectorette -Onit 6 
(oligonucleotide 27.40) was used together with 7387 cell line ONA. 

The amplification reaction was carried out on 10 ng of the constructed library in order to amplify the 
sequence depicted in Rgure 12. 



Amplification conditions 

The following were mixed in a 0.5 ml Sarstedt tube. 
35 10 ng of constructed EcoRI vectorette library 
I00pmole58 • " , 

100pmole63 

lOOum (final concentration) dNTPs (see Ml .7 as defined hereinafter) 

The volume was adjusted to lOOum with sterile double distilled H3O. SOul of light mineral oil (Sigma) 
40 was gently layered over the reaction mix in order to prevent evaporation. The lube was than placed in 
Techne Ori-BkxA set at 96«C. The ONA was allowed to denature for 10 minutes at 96*0. The block was 
then allowed to cool to 90^. 2 units of Tag polymerase (Cetus) were added to the reaction mix. The 
following temperatures and times were employed to amplify the sequence between oligonucleotides 58 and 
63. 

45 91* 2 minutes 
62** 2 minutes 
72* 2 minutes 

40 cycles were performed. On Ihe final cycle the temoerature of the block was maintained at 72''C for 
9.9 minutes then allowed to cool to room temperature over a period of 1 hour, 
so 1 5 al of the amplification reaction mixture was analysed by agarose gel electrophoresis. 

The result is shown in figure 13 which shows the amplification product obtained according to this Example 
in lane 1 and the marker «X174 cleaved with Hae III in lane 2. The amplification product is of the expected 
size {ca 600 bp). 

65 

Example 9 

This example was effected according to method IX as follows:- 
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Step 1 



Phosphorylation of the 'top' oligonucleotide 

Oligonucleotide 49. 51. 52. 53. 55. 27. 57 and 2. 

Buffers and other standard solutions 

to 



1.1 lOx kinase tHjffer 


0.5M 


Tris.HCI (pH 7.6) 


0.1 M 


MgCl2 


50nnM 


Oithiothreitol 




Spermidine 




EDTA 


Stored at •20*»C 



1.2 Adenosine s'-triphosphate 

25 

Sodium salt bought from Pharmacia as a lOmM solution. 
Diluted to 20pmole/ul in water 
Stored at •20°C 

30 Procedure 

The following were mixed in a l .5ml Sarstedt tube: 



Oligonucleotide 


1 0 pmoles 


lOx kinase buffer (1.1) 


lul 


ATP (1.2) 


iul (20 pmoles) 


T4 polynucleotide kinase 


5 units 


(Amersham) 




H2O 


to lOai 



The molar ratio of oligonucleotides to ATP was. maintained at 1 :2. , 
Phosphorylation of the oligonucleotides was carried out by incubation at 37*'C for 60 minutes. 
At the end of this period the T4 polynucleotide kinase was inactivated by incubation at 90°C for 10 
minutes. The tubes were then spun in a microcentrifuge for 10 seconds to collect all the condensate. 
The phosphorylated species are not get purified in this method. 

50 Step 2. 



Annealing of the phosphorylated 'top' oligonucleotide to the corresponding 'bottom' oligonucleotide 
55 Solution required: 
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2.1 lOx Annealing buffer 


)QQmM 
lOOmM 


Tds HCI (pH 8.0) 
MgCli 



The following were mixed in a 1 .5ml screwcap Starstedt tube> 
10 pmoies (in lOul) of phosphorylated 'top* oiigonucleoticte from step 1 
10 pmoies of the corresponding 'bottom* oligonucleotide 
2ul of 1 0x annealing buffer (2. 1 > 
HzO to 20ul 

The tube was placed in a boiling water bath. After 5 minutes heating was discontinued and the water 
bath allowed to cool slowly to room temperature over a period of . about 2 hours. Rnally the contents of the 
tube were collected at the bottom by centrifugation in a microfuge for 10 seconds. The annealed 
oligonucleotides, at a concentration of 0.5pmole/ul. are then ready for use in ligations. 

Vectorette units 1-8 were as defined in Table 1 hereinbefore. 



Step 3 



Preparation of DNA 

Cell line 7387 DNA was used. 

Suffers required: 

30 



35 



3-1 1 0x EcoRl buffer 


I00ml\/1 


Tris-HCl pH 7.5 


lOOmM 


MgCli 


500mM 


NaCi 



40 


3.2 TH Buffer 




lOmM 


Tris-HCl pH 7.5 




ImM 


EDTA 



3.3 Agarose sample buffer 


15% (w.'v) 


Rce!! 400 


0.05% (w/v) 


Bromophenol Blue 


0.05% (w/v) 


Xylene Cyanol 


The sample buffer was made up in ixTBE 


buffer (3.4 below). 
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3.4 lOxTBE buffer: 


per litre:- 




Tris 


I09g 


EDTA 


9.59 


Boric Acid 


55g 



lOxTBE buffer was diluted in water to 1x or 0.5x strength as required. 

(i) The .following conditions were employed for cleaving the 7387 cell line ONA with the restriction 
endonuclease EcoRi:- 
40ug of 7387 genomic ONA 
40ul of 10X EcoRl buffer (3.1) 
80 units of the restriction endonuclease 
Ecofll (from Boehringer Mannheim) 
HjO to 400ul 

#1 

Digestion was carried out at 3r»C for 2 hours. The reaction was terminated by heating the sample at 
70^ for 10 minutes. 

(ri) 5 ul of the sample was checked by agarose gel electrophoresis to ensure complete digestion:- 



Agarose gel electrophoresis 

^ The geis were made 0.7% (w/v) agarose (Pharmacia) in IXTBE (3.4) buffer and submerged in the same 
strength TBE buffer for electrophoresis. To the sample (5 al). a fifth-volume of agarose sample buffer (3.3) 
was added prior to loading the samples in the wells of the gel. Following electrophoresis the gel' was 
stained for 10 minutes by submerging in deionised water containing lug/ml ethidium bromide. The ONA 

^ was visualised at 302nm using a transilluminator (Chromato-Vue C-63 transilluminator). 
(iiiXa) The digested ONA was purified by phenol extraction:- 

An equal volume of buffered redistilled phenol (BRL ONA grade phenol buffered with TE.3.2) was 
added to the restricted ONA and the mixture vortexed for 30 seconds. The white emulsion formed was 
separated into two phases by microcentrifugation for 5 minutes. The aqueous phase was transferred to a 
fresh tube. Any residual phenol was removed by first extracting the aqueous layer with an equal volume of 
phenol: chloroform: isoamylalcohoi (25:24:1) . an equal volume-, and then with an equal volume of 
chlorofonm: isoamylalcohoi (24: t). 

(b) The ONA was finally purified by ethanol precipitation: 
^ To the ONA solution, of 3M sodium acetate was added to give a final concentration of 300 mM sodium 
acetate. The ONA was prea'pitated by the addition of 2.5 volumes of 95% ethanol and cooling at -SO^C for 
30 minutes followed by centrifugation (microcentrifuge for 5 minutes. I2.000rpm). The pelleted ONA was 
washed with 80% (v/v) cold ethanol and dried under vacuum. The ONA was dissolved In the TE buffer (3.2) 
to give a concentration of 0.5ug/l. 

45 

Step 4 



Ligating the annealed oligonucleotides to the cleaved ONA 



Buffers and other standard solutions 
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4.1 lOx ligation buffer 


200mM 


Tfis-HCl pH 7.4 


lOOmM 


MgCl2 


lOOmM 


OTT 



4.2 t Ox ATP 


lOmM 


ATP 



4.3 


0.5M 


NaCi 



The caJculation of the pmoles of sticky ends in lug of cleaved DNA was as set out in 4.l(i) of method I. 
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Ligations 

The following were mixed in a O.Smi Sarstedt txjbe: 



Cleaved 7387 ONA 


1 .48 pmoles (2ug) 


Annealed oligonucleotides (from step 2) 


3 pmoles 


1 0x ligation buffer (4.1) 


2ul 


lOx ATP(4.2) 


2ul 


T4 ONA ligase (Boehringer Mannheim) 


2 units 


H2O final volume to 


20ui 



The ligation mix was left at 25*^ for 120 minutes. NaCI (4.3) was then added to the ligation mix to a 
final concentration of 50mM. Any genomic DNA fragments ligated to other genomic ONA fragments were 
recut by the addition of 2 units of Ecof^l (Boehringer Mannheim) and incubation of the sample at 37*0 for 
30 minutes. Vectorette units were then ligated to these recut genomic DNA fragments by adding: 
ATP to ImM concentt-ation (4.2) 
1 .5 pmoles of annealed oligonucleotides 
. 1 unit of T4 DNA ligase 
(Boehringer Mannheim) 

and incubation at 25X for 1 hour. After recutting the ONA at 37^ for 30 minutes with EcoRl. the restriction 
endonuclease is not inactivated. 

Therefore during the second and final ligation at 25^*0 for 60 minutes, any genomic ONA fragments 
ligating to other genomic ONA fragments are presumably being recieaved in the same reaction, such thai at 
the end of this restriction/ligation reaction virtually 100% of the genomic ONA fragments have the vectorene 
units at both ends. Ligation of the vectorette unit to cut genomic DNA inactivates that particular restiiction 
site. 

The ONA (reaction mix) was then diluted to lOng/ul. Ecofll vectorette libraries were constincted using 
this method and vectorette units 1 to 8 together with 7387 cell line ONA. 

A control EcoRt library was also constructed where the vectorette units were omitted from the ligations 
described above 
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Step 5 



Amplification appMed to the vectorette libraries, 

5 . 

lOng of each of the libraries constructed (for vectorette units 1-8 and the DNA minus vectorettes control 
library) were subjected to amplification. The sequence shown in Rgure i2 was amplified using 
Oligonucleotides 58 and 63 as amplimers and the amplification conditions used were as described in 
Example 8. 

to 1 Sal of the amplification product was analysed by agarose gel electrophoresis. 

The result is as shown in Rgure 14. The expected size (ca 600bp) product has been produced in ail 
reactions except where the substrate ONA is from the library constructed without vectorettes in the ligation 
mix. This is as expected since the sequence of oligonucleotide (one of the amplimers above) is within the 
vectorette units. 

/5 These results indicate that the design of the vectorette units has no significant affect on the 
amplification of the nucleotide sequence between the vectorette units and any desired genomic sequence 
(ie. oligonucleotide 63 above) 

The results are shown in Rgure 14 which shows a photograph of and ethidium bromide stained ge» 
depicting the products obtained which are as set out below:- 

20 1 . Library constructed with 7387 ONA * vectorette unit 1 

2. Ubrary constructed with 7387 ONA > vectorette unit 2 

3. Ubrary constructed with 7387 ONA + vectorette unit 3 

4. Ubrary constructed with 7387 ONA vectorette unit 4 

5. Ubrary constructed with 7387 ONA + vectorette unit 5 
25 6. Ubrary constructed with 7387 DNA + vectorette unit 6 

7. Ubrary constructed with 7387 ONA > vectorette unit 7 

8. Ubrary constructed with 7387 ONA minus vectorettes 

9. «X174 Haelll marker 

10. <pX174 Haein marker 

30 11. Ubrary constnjcted with 7387 DNA + vectorette unit 8 The expected size product is obtained in ' 

each case. 



Example 10 

35 ■ 

This example was effected according to method X as hereinbefore described. 

EcoRl vectorette libraries were constructed using this method and vectorette units i to 7 as herein 
defined arvJ 7387 cell line OHA. 

A control EcoRl library was also constructed where the vectorette units were omitted from the ligations 
^ described above. 



Amplification applied to the vectorette libraries 

45 lOng of each of the libraries constructed (for vectorette units 1-7. and the DNA minus vectorettes 
control library) were subjected to amplification. The sequence shown in Example 8 was amplified using 
oligonucleotides 58 and 63 as amplimers. The amplification conditions described in Example 8 were used. 

I5ul of the product was analysed by agarose gel electrophoresis. 
• The result is as shown in. Figure 15. The expected size (ca 600bp) product has been produced in all 
50 reactions except where the substrate DNA Is from the library constructed without vectorettes in the ligation 
mix. This is as expected since the sequence of oligonucleotide 58 (one of the amplimers above) is within 
the vectorette units. 

The lanes of the photograph shown in Rgure 15 are as follows:- 
1. -«X1 74 Haelll marker ^ 
55 2. Amplification product from library 7387 ONA * vectorette unit 1 

3. Amplification product from library 7387 ONA + vectorette unit 2 

4. Amplification product from library 7387 ONA + vectorette unit 3 

5. Amplification product from library 7387 ONA vectorette unit 4 
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6. Amplification product from library 7387 ONA + vectorette unit 5 

7. Amplrfjcation product from library 7387 DNA > vectorette unit 6 

8. Amplification product from library 7387 OI^A vectorene unit 7 

9. Amplification product from library 7387 DNA minus vectorettes. 
5 10. oX 174 Hae III marker. 

The expected result is obtained from each library. 

These results indicate that the design of the vectorette units has no significant affect on the 
amplification of the nucleotide sequence between the vectorette units and any desired genomic sequence 
(ie oligonucleotide 63 above). 

Example 1 1 

This example was effected according to method XI as herein before, described. 
iS EcoRl vectorette libraries were constructed using this method and vectorette units t to 7 and 7387 cell 
tine ONA. 

A control EcoRl library was also constructed where the vectorette units were omitted from the ligations 
described above. 

Amplification applied to the vectorette libraries 

lOng of each of the libraries constructed (for vectorette units 1-7. and the ONA minus vectorettes 
control library) were subjected to polymerase chain reaction. The sequence shown in example 1 was 
25 amplified using oligonucleotides 58 and 63 as amplimers. The amplification conditions described in 
Example 8 were employed. 

I5ul of the product was analysed by agarose gel electrophoresis. 

The result is as shown in Rgure.l6. The expected size (ca 600bp) product has been produced in all 
reactions, except where the substrate ONA is ifrom the library constructed without vectorettes in the ligation 
30 mix. This is as expected Since the sequence of oligonucleotide 58 (one of the amplimers above) is within 
the vectorette units. 

The lanes of the photograph shown In Rgure 16 are as follows:* 

1, oX174 Hae III mari<er 

2. Amplification product from library 7387 ONA + vectorette unit t 
35 3. Amplification product from library 7387 ONA vectorette unit 2 

4. Amplification product from library 7387 ONA + vectorette unit 3 

5. Amplification product from library 7387 ONA vectorette unit 4 

6. Amplification product from library 7387 ONA > vectorette unit 5 

7. Amplification product firom library 7387 ONA + vectorette unit 6 
40 8. Amplificadon product from library 7387 ONA + vectorette unit 7 

9. Arnplification product from library 7387 ONA minus vectorettes 

10. ^X174 Haelll marker. 

These results indicate that the design of the vectorette units have no significant adverse affect on the 
amplification of the nucleotide sequence between the vectorette units and any desired genomic sequence 
45 (ie oligonucleotide 63 above). 

Example 12 

50 This example was effected according to metiwd Xll as hereinbefore described. 

EcoRl vectorette libraries were constructed using this method and vectorette units i to 7 and 7387 cell 
line ONA. 

A control EcoRl library was also constructed where the vectorette units were omitted from the ligations 
described above. 

55 

Amplification applied to the vectorette libraries 
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lOng of each of the libraries constructed (for vectorette units 1-7. and the ONA minus veciorenes 
control library) were subjected to amplifrcation. The sequence shown in Example i was amplified using 
oligonucleotides 58 and 63 as ampilmers- The amplification conditions described in Example 8 were 

employed. 

1 5al of the amplification product was analysed by agarose gel electrophoresis. 

The result is as shown in Rgure 17. The expected size (ca 600bp) product has been produced in all 
reactions except where the substrate DNA is from the control library constructed without vectorettes in the 
ligation mix. TTirs is as expected since the sequence of oligonucleotide 58 (one of the amplimers above) is 
within the vectorette units. 

The lanes of the photograph shown in Rgure 1 7 are as follows:* 

1. *X174 Haelll marker 

2. Amplification product from library 7387 ONA ♦ vectorette unit 1 

3. Amplification product from library 7387 ONA > vectorette unit 2 

4. Amplification product from library 7387 DNA + vectorette unit 3 

5. Amplification product from library 7387 ONA + vectorette unit 4 

6. Amplification product from library 7387 DNA * vectorette unit 5 

7. Amplification product from library 7387 DNA t- vectorette unit 6 

8. Amplification product from library 7387 ONA + vectorette unit 7 

9. Amplification product from library 7387 ONA minus vectorettes 

10. 4>X174 Haelll marker. 

These results indicate that the design of the vectorette units have no significant affect on the 
amplification of the nucleotide sequence between the vectorette units and any desired genomic sequence 
(ie oligonucleotide 63 above). 



Example 13 



This example was effected according to method Xtll as hereinbefore described. 
EcoRl vectorette ilbrarii»s were constructed using this method arid vectorette units i to 7 7387 cell 
30 line DNA. 

A control EcoRl library was also constructed where the vectorette units were omitted from the ligation 
reaction. 



35 Amplification applied to the vectorette libraries 

lOng of each of the libraries constructed (for vectorette units 1-7. and ' the DNA minus vectorettes 
control library) were subjected to the amplification. The sequence shown in example 1 was amplified using 
oligonucleotides 58 and 63 as amplimers. The PGR conditions described in Example i3 were employed. 
40 1 Sal of the amplification product was analysed by agarose gel electrophoVesis, 

The result is as shown in Rgure 18. The expected size ( 6C>0bp) product has been produced in all 
reactions except where the substrate DNA is from tiie library constructed without vectorettes in the ligation 
mix. This is as expected since the sequence of oligonucleotides 58 (one of the amplimers above) is within 
• the vectorette units. 
45 The lanes of the photograph shown in Rgure 18 are as fbllows:- 

1. «X174 Haelll marker. 

2. Amplification product from library 7387 DNA + vectorette unit 1 

3. Amplification product from library 7387 ONA vectorette unit 2 

4. Amplification product from library 7387 DNA ♦ vectorette unit 3 
so 5. Amplification product from library 7387 DNA -t- vectorette unit 4 

6. Amplification oroduct from library 7387 DNA + vectorette unit 5 

7. Amplification product from library 7387 DNA + vectorette unit 6 

8. Amplification product from library 7387 DNA + vectorette unit 7 

9. Amplification product from library 7387 ONA minus vectorettes. 
55 10. 0X174 Haelll marker 

These results indicate that the design of the vectorette units have no significant affect on the 
amplification of the nucleotide sequence between the vectorette units and any desired genomic sequence 
(see oligonucleotide 63 above). 
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Example J_4 

Five different EcoRi veciorette libraries were constructed using 7387 cell line ONA and vectorette unit 8 
In the following way. 
s (1) 

(a) One library was constructed using Method tX 

(b) One library was constructed using Method X 

(c) One library was constructed using Method XI 

(d) One library was constructed using Method XII 
to (e) One library was constructed using Method XIII 

(2) Prior to amplification, the libraries constructed in (l) above were subjected to a 3'-end blocking 
reaction:* 



/5 Buffers and other standard solutions 



20 



7.2.1 T7 DNA polymerase 


buffer 5x 




200m M 


Tris-HCl pH 7.5 


lOOmM 


MgClz 


250mM 


NaCi 



29 





7,2.2 




30 


dNTPs-dATP 


imM 


dTTP 




tmM 


dGTP 




ImM 


dCTP 




made up in 


3S 


sterilised 




deionised 




water. 





7:2.3 ddATP 

ImM ddATP made up in sterilised deionised water. 

Addition of a ddA residue to s'rends of products from (1) above. 

The following were mixed in a 0.5ml Sarstedt tube: 
O.Sul of ONA from step (1) above 
4ul of 5x T7 ONA polymerase buffer (7.2.1) 
Zu.i dNTr$-dATP Suiution (7.2.2) 
2ul ddATP solution (7.2.3) 
1 unit of T7 ONA polymerase (Sequenase) 
HsO to 20ui 

The tube was. maintained at 37**C for 30 minutes. The enzyme was then denatured by incubation at 
70^0 for 10 minutes. 

The concentration of ONA in all the samples was then diluted in water to tOng/ul. 
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lOng of each of ihe five libraries constructed were subjected to polymerase chain reaction The 
sequence shown .n Example 8 was annpiified using oliognucleotides 58 and 63 as amplimers The 
amplification conditions in Example 8 were employed. 

I5ul of the amplification product was analysed by agarose gel electrophoresis 
5 The result is as shown in Figure 19. 

The lanes of the autoradiograph shown in Figure I9 are as follows - 

1. oX 174 Haelll marker. 

2. Amplification product from 7387 ONA + vectorette unit 8 method IX * 3'end blocking 

3. Amplification product from 7387 ONA + vectorette unit 8 method X ^ s'end blocking 
10 4. Amphfication product from 7387 ONA ♦ vectorette unit 8 method XI ^ 3'end blocking 

5. Amplification product from 7387 ONA + vectorette unit 8 method XII * 3 end blocking 

6. Amplification product from 7387 ONA ♦ vectorette unit 8 method XIII ^ 3'end blocking All 
products are of the expected size. 

Example 15 

5 different EcoRl vectorette libraries were constructed using 7387 cell line ONA and vectorette unit 8. 
(a) one library was constructed using method IX 
so (b) one library was constructed using method X 

(c) one library was constructed using method XI 

(d) one library was constructed using method XII 

(e) one library was constructed using method XIII except that oligonucleotide 2 was nonphosphorylai. 

ed. 

« Ubraries (b) to (e) were prepared as described for methods IX to XII Ubraries (b). (c). (d,) (e) and an aliquot 
of library (a) were treated with ddATP and T7 ONA polymerase (Sequenase) to block free 3' ends 
Conditions were as described in Example 14. 

The amplification was carried out on all of these libraries. The sequence shown in Rgure 12 was amplified 
using oligonucleotides 63 and sa The conditions described in Example 8 were employed 

iSal of the amplification product was analysed by agarose gel electrophoresis (1.4% w/v gel). 

The result obtained was as shown in Rgure 20. 

The lanes of the photograph shown in Rgure 20 are as follows:- 

1. Amplification product from 7387 ONA + vectorette unit 8 method IX (oligonucleotide 2 non- 
35 phosphorylated) 

2. Amplification product from 7387 ONA 
(Oligonucleotide 2 non*phosphorylated) 

3. Amplification product from 7387 ONA 
(oligonucleotide 2 non^hosphorylated) 

40 4. Amplification product from 7387 ONA > 

(oligonucleotide 2 non-phosphory(ated) 

5. Amplification product from 7387 ONA 
(oligonucleotide 2 non*phosphorylated) 

6. Amplification product from 7387 ONA* 
^ (oligonucleotide 2 non-phosphoryiated) 

7. «X174 Haetll marker. All products are of the expected size. 

Example 16 

so . ^ 

This- example was conducted to show that the ca SOObp product shown in Figures 13-20 is a product of 
oligonucleotides 58 and 63 (see example 8) and not a product of one of the amplimers. 

tOng of each of the following vectorette libraries was used for each amplification: 

A library prepared using 7387 ONA and vectorette units 1-8 in method VII. 
55 A library prepared using 7387 ONA and vectorette unit 8 using method VII + 3'-end biockino (see 
example 14 for details). ^ 

A library prepared using 7387 ONA and vectorette unit 8 (lb oligonucleotide non phosphorylated) using 
method VII + 3 -end blocking (see example i5 for details). — 





vectorette 


unit 


8 


method 


IX 


4- 


3'end 


blocking 




vectorette 


unit 


8 


method 


X 


+• 


3'end 


blocking 




vectorette 


unit 


8 


method 


XI 




3'end 


blocking 


+ 


vectorette 


unit 


8 


method 


XII 




3'end 


blocking 


+ 


vectorette 


unit 


8 


method 


XIII 




3'end 


blocking 
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A library prepared using 7387 ONA minus vectorette using method Vli. 

To show that products produced In example 8 from libraries constructed with vectorette unjts i -8 were 
all identical and of the correct sequence amplification products shown in Rgure 16 (products 2-3) and the 
product shown in lane 1 1 of Figure 1 4 were sequenced using oligonucleotides 5 and 66. 

5 Rgure 21 shows a photocopy of the autoradiograph produced from the sequencing of products 2-8 
from Rgure 16 with oligonucleotide 66. 

The autoradiograph also shows the sequencing of parts of Intron 8 and exon 9 of the phenylalanine 
hydroxylase gene (phenylketonuria -PKU) as a control. The amplification products 2-8 appear from left to 
right (as shown in Rgure 1 6) and are ail Identical. The lane order is CGTA. 

to Rgure 22 shows the nucleotide sequence data generated from the products described in Example 1 6. 
Bases i to 1 6 are at the 3' end of the published PKU intron 8 sequence. Bases t7 to 452 constitute new 
PKU intron 8 sequence data determined according to the method of the present invention. 



;5 Example 17 

The ca 220bp product from example 7(i). track(c) lanes 7;' 8. 9. 10. 11 was sequenced using 
oligonucleotides 62 and 67. 

Thus the 220bp product from Example 7(1) was 
20 1 ) amplified In 3 step cycles a) eO^C 2 minutes, b) 72**C 3 minutes and c) 9i*'C 2 minutes. 40 cycles 

being perfomied (O.Sug DNA used)/l00ul loaded/track (see Rgure 24a) 

2) the product from (1) above was then gel purified and reampiified in 2 step cycles a) 60°C 4 
minutes and b) 91 "C 2 minutes. 40 cycles being performed (see Rgure 24b). 

The positions of the oligonucleotides 58, 62 and 67 are shown in Rgure 23. In Rgure 25. the sequence 
25 set out in (a) is the sequence read with oligonucleotide 62. the sequence set out in (b) being the sequence 
. read with oligonucleotide 67. In (a) and (b) the underlined sequence Is the complementary rieverse 
sequence read with oligonucleotide 62 and 67. The overall 5 —3' sequence is depicted in Rgure 25(c) in 
which:- 

• Positions 2 • 31 reads the 5 -3' sequence of oligonucleotide 62. 
30 Positions 75- 104 reads the reverse complementary sequence of oligonucleotide 67. 

Positions 95 • 104 (marked by -a continuous line above the sequence) is the first 10 bases of the published 
intron sequence (see Biochemistry 1985 25 p556-56l Kwok et al). 

Positions 2-104 match the published sequence for PKU exon I and parts of the Intron sequences. 
Position 105 onwards is the new sequence data for PKU intron 1. 
J5 ' 

Example 18 

Exon V of the alpha antitrypsin gene was subjected to the method of the present invention (see Rgure 
40 26) 'jsing the following oiigonucleotides:- 

58 (universal vectorette prirner). 60 (universal vectorette nested oligonucleotide). 68 (exon V 5 ), 69 (exon V 
5' nested)'and 70 (exon V 3 ). 

The vectorette library was made with EcoRI cut genomic DMA with annealed vectorette oligonucleotides 
(87:40 as herein defined) and ligated. 
45 The amplification was effected as follows:- 

100 ng of the vectorette library was used in a first round of amplificatfon as follows:- 



^OOng Vectorette Library ONA 




Oligonucleotides 58 > 68 (100 pmoles) 






lOul Buffer 500m M KCI 


lOOufWdNTP's 


. lOOmf^ MgCh 


2U Taq polymerase 


l4mM MgCb 


Make up volume to lOOul with double distilled water 


0J% gelatin] 



(^nditions of reaction: 

Initial .denaturation step** 93*^ for 10 minutes 
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39 cycles of 
93*'C 2 minutes 
es^C 2 minutes 
za^C 8 minutes 
5 Final cycle: 72'C for 10 minutes 

The result is shown in Rgure 27|a) in lane 4. Lane 3 represents a control amplification reaction using 
Oligonucleotides 68 and 70 to produce a ca 200bp product 

The primary round of amplification *gave a smear of the approximate size expected- The primary 
to reaction mix was then diluted as follows:- 

lul of the product of lane 4 in Rgure 27(a) was made up to imi with ddHaO 
(ie, 10^ dilution) 

iu! of this was used in subsequent PCR's (total = tOOiil) 
(ie. Total dilution = 10^) 
15 and reamplified using a range of primer pairs as follows:* 



2nd PCR (with nested 
oligonucleotides) 




lul the product of lane 4 in Figure 27(a) (103)ONA 


lOOpmoles oligonucleotides 60 69 


SOOmM KCI 
lOOmfy^ Tris pH8.3 
^QmM (VlgCla 
0.i% gelatiri 


lOul 10X Buffer. 

lOOum dNTP's 

2 units Taq Polymerase 


10X buffers: 


Make up volume to lOOul with double distilled water 



Conditions of reaction: 

Initial denaturation step -^5^ for 10 minutes 

The secondary amplification product is shown as a 3.0 kb product in lanes \2 and 3 of Rgure 27(b). 
Rgure 27(b) is a photograph of an agarose gel showing the product of secondary amplification between 
35 Oligonucleotides 60 and 69. 

Lane i the 3.0 kb product of secondary amplification between oligonucleotides 60 and 69 

2 the 3.0 kb product of secondary amplification betwen oligonucleotides 60 arid 69 

3 the 3.0 kb product of secondary amplification between oligonucleotides 60 and 69 

4 the marker 74 cleaved with Hae ill and the marker X cleaved with Hind IIL 

40 The gel was blotted onto a nyton membrane and probed with an a-i. anti trypsin probe. A single band 
of the correct size (3.0 kb) was observed and is shown in Rgure 27(c). 

Figure 27(p) is an autoradlograph of a nylon filter (blotted from the gel shown in Rgure 27(b)) which had 
been probed with an o-i. antitrypsin probe. The position of ONA markers is shown. - 

Lane t the 3.0 kb product of secondary amplification between oligonucleotides 60 and 69 
45 2 the 3.0 kb product of secondary amplification between oligonucleotides 60 and 69 

3 the 3.0 kb product of secondary amplification between oligonucleotides 60 and 69 

Example 19 

50 . 

05 is an anonymous genomic sequence which has been cloned as a 474 bp Xbal -EcoRi fragment. 
The sequence of 05 has been determined by standard methods (Newton et al. Nucleic Acids Research 16. 
8233-8243 1988) and is shown in Figure 29. 
Two oligonucleotide primers, 71. 72 were synthesised. . 
55 71 5' AAGTTTGAGCATAGGAAAAGTTCTGTGCCC 
72 5' AGTTCTGTGCCCAAAATTGCATCCAAG 
The position of oligonucleotides 71 , 72 is indicated on Rgure 29. 

Genomic ONA was isolated from the lymphoblastoid cell line. GM 7387. (NIGMS Human Genetic 
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Mutant Cell Repository) and aliquots {20ag) were digested with the following restriction enzymes: Ora I, Bel 
I, Hind III and Has HI. Vectorene libraries were then prepared according to Method X. 

Primary amplifications were performed on aliquots of each library as descnbed hereinunder. 

The following were mixed in a 0.5 ml Sarstedt tube. 

s 

10 ng of conscrueced veccorecce library 
100 poole ollgonucleoclde 58 
100 pnole ollgonucleoclde 71 

to 



lOX Buffer Ist SOOaM KCl 

lOOmM Tn»-HCl pH8.3 
lOoM MgCl 
. 0*11 gelacln. 

The vblume was adjusted to 100 til with sterilb. double distiiied water.' SO ul of light mineral oil (Sigma) was 
gently pipetted over the reaction mix in order to prevent evaporation. 

The tube was then heated to 96*C (in a Techne Programmable Dri-Block PHC-1 which had been 
25 ecuilibrated at 96**C). The DNA was allowed, to denature for 10 minutes at 96'C. The block was altowed to 
cool to 90*^0. Two units of Taq polymerase (Cetus) were added to the reaction mix. The following 
temperatures and times were employed to amplify ttie sequence between oligonucleotides 58 and 71 . 
92*C 2 minutes 

- • es^'C 2 minutes ■ 
30 72**C 5 minutes 

40 cycles were performed. During the fmal cycle, the reaction mix was held at 72^C for tO minutes. 

Aliquots (1 5 ul) were withdrawn from the reaction mixtures. Oye loading mix <5 ut) was added and the 
samples were analysed on 1 .4% agarose gels, 
Oye Loading Mix 
35 1 55 (w/v) Rcoll 400 

0.05% (w/v) Bromophenol iBIue 
0.05% (w/v) Xylene Cyanol 

Dissolved in'lX TBE A secondary amplification was performed on aliquots of the primary reaction mixtures. 

An aliquot of the primary reaction mixture (2 ul) was diluted to 400 ul with vyater. An aliquot of the diluted 
40 material (2 ul) was placed in a 0.5 ml Sarstedt tube. 
The following were also added to the tube. 

1 00 pmoie oligonucleotide 60 

100 pmole oligonucleotide 72 

lOOuMdNTPs 
45 10 uMOX BUFFER 

10X Buffer is: 

500 mM KCl 

l00mMTrisHClpH8.3 

10 mM MgCl 
50 0.1% gelatin 

The volume was made up to 100 ul with sterile, double distilled water. 50 ul of light mineral oil (Sigma) 
was gently pipetted over the reactran mix in order to prevent evaporation. 

The tube was then heated to 90^ (in the Techne Programmable Dri-Block PHC-i which had been 
equilibrated 96*C). The ONA was allowed to denature for 10 minutes at 96^C. The block was allowed to cool 
55 to 90°C. Two units of Taq polymerase (Cetus) were added to the reaction mix. The following temperatures 
and times were employed to amplify the sequence between oligonucleotides 60 and 72. 
92^C .2 minutes 
65°C 2 minutes 



100 \xm dNTPs 
10 111 lOX Buffer 
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72^0 5 minutes 

37 cycles were performed. During the final cycle, the reaction mixture was held at 72«C for lO minutes. 

Aliquots (15 al) were withdrawn from the reaction mixtures. Dye loading mix (5 ul) was added an the 
samples were analysed on a 1.4% agarose gel. 
5 Dye Loading Mix 
15% (w V) Ficoll 400 
0.05% Bromophenol Blue 
0.05% Xylene Cyanol 
Dissolved in ix TEE 

ro The libraries (Hae III. Hind III. BcU, Dral) gave distinct products in the size range 0.4-1.5 kb (Figure 30). 
From the observed size of the products, a provisional restriction map could be prepared (Figure 3i). The 
products were further analysed by restriction digests. Authentic products should be cleaved to give the 
predicted size fragments. Thus, the 0.82 kb Bell product could be cleaved by Hind III to give a product of 
0.78 kb and could also be cleaved by Hae III to give a fragment of 0.75 kb. The analysis of the 0.82 kb Soil 

IS product by Hind III and Hae HI is shown In Figure 30. The predicted products were observed and the Bell 
product was then eluted for sequence analysts. 

Thus in Figure 30 the product resulting from amplification of a Hae 111 vectorette library is shown in 
Lane 2, the product resulting from amplification of a Hind III vectorette library is shown in Lane 3. the 
product resulting from amplification of a Bell vectorette library is shown in Lane 4 and the product resulting 

20 from amplification of a Ora I vectorette library is s.ho'/wi in Lane 5. Lane 7 shows the fragments obtained 
after digestion of the Bell product with the restriction enzyme Hae III. Lane 8 shows the fragments obtained 
after digestion of the Bell product with the restriction enzyme. Hind III Lane 9 shows the Bell product. 
Lanes 1 and 6 contain 1 kb OriA markers (BRL). 
Restriction Digest Analysis of Vectorette Products. 

35 ' ■ 



16ul 


Amplification Reaction Mixture 


2ui 


10 X Restriction Buffer 


2ul 


Restriction Enzyme 



The sample was incubated at 37*0. for 1 hour. 5 llI of Dye Loading Mix was added to the sample and 
the products were analysed on 1.4 % agarose gei. 

35 

SEQUENCING 

The Bell product was eluted from the agarose gel and . sequericed using oligonucleotide 59 and 
standard methodology (Newton et al Nucleic Acids Research lei 8233-8243 1988). The sequence is 
40 illustrated in Rgure 31. A further primer (73) was designed from this sequence and used in subsequent 
walks. 

73 5' GGCCTTTGANNAAGAGAAGAGTCAAGGATG 

Examination of the sequence confirms the presence of the Hae III and Hind III sites predicted by walks 
according to the method of the present invention. 



Example 20 



50 Examination of the Chlamydia genome using the method of the present invention. 



Chlamydia trachomatis serotype L2 was grown in cyclohexirnide treated McCoy cells. Elementary 
bodies were purified on glycerol-tartrate gradients yielding 10« elementary bodies. DNA was extracted by 
treatment with proteinase K/SDS. phenol/chloroform and recovered by ethanol precipitation. 90 mg of DNA 
55 was recovered of which 900ng was estimated to be Chlamydial DNA and the remainder was mouse DNA 
derived from the McCoy cells. 

DNA (5 ug equivalent to 50 ng Chlamydial ONA) was digested with either EcoRI or BamHI. Vectorette 
libraries were prepared according to Method X using the appropriate oligonucleotides (27. 40, EcoHi. 31. 
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40. Bam HI). The libraries were made up to a Hnal volume of lOO ul. 

Two oligonucleotide primers were synihesised based on me consensus sequence of MOMP (Major • 
Outer Membrane Protein) of Chlamydia L2. (Stephens ei alJ Bacteriology I69. 3879-3885 1987). 
74 5' CTGCTCACGTAAATGCACAATTCCG 
5 75 5* AACCGAGAGCAAAAGCCATATTGGC 



Ampiification 

to 

The followjng were mixed in a 0.5 ml Sarstedt tube. 



10 ul 


EcoRI vectorette library 


20 ul 


5X Buffer 






5X Buffer » 


335 mM Tris HCI pH 8.5 






83 mM NH«S04 






lOmMMgCI 






50 mM 2-mefcaptoethanot 






0.8 mg/ml Bovine Serum Albumin 


lOO pmole 


Oligonucleotide 58 




100 pmole 


Oligonucleotide 74 




200 uM 


dNTPs 





The volume was adjusted to lOO ul with sterile, double distilled water. 50 ul of light mineral oil (Sigma) 
was gently pipetted over the reaction mix in order to prevent evaporation. 

The tube was then heated to 96*C (in the Techne Programmable Dri-Block PHC-1 which had been 
equilibrated to 96*^). The ONA was allowed to denature for 10 minutes at 96°C. The block was then ' 
allowed to cool to go°C. Two units of Taq polymerase (Cetus) were added to the reaction mix. The following 
temperatures and times were employed to amplify the sequence between oligonucleotides 60 and 72. 
92®C 0.7 minutes 
65*C 0.7 minutes 
72**C 1 minute 

40 cycles were performed. 10% of the reaction mix was removed for analysis on agarose gel. The 
results are shown in Figure 32 in which. 
Lane 1 shows the marker oX174 cleaved with Hae III. 

Lane 2 shows the product of amplification between oligonucleotides 58 and 74 using 10 ul of an EcoRI 
library. 

Lane 3 shows the product of amplification between oligonucleotides 58 and 74 using 1 ul of an EcoRI 
library. 

The product from the amplification of an EcoRI library with oligonucleotides 58 and 74 was purified. 
Asymmetric amplification (Proc. Natl. Acad. Sci. USA Vol 85 pages 7652-7656. October 1988 U.B. 
Gyllenstein and H. Ehrtich) of the product was performed with 50 pmole oligonucleotide 74 and 
oligonucleotide 58 for 35 cycles as described above. The ampHfied product was purified on a G50 spun 
column (as described in Molecular Cloning - A laboratory manual; Maniatis T. Fritsch E.F.. Sambrook J. 
Cold Spring Harisor Laboratory 1982) and sequenced with oligonucleotide 59 as described in example 17. 
The sequence is shown in Rgure 3A.. Digestion of an aliquot cf the PCR product with either Hinf I or Cla I 
confirrns the presence of these restriction sites (Figure 34). 

In Rgure 34 the lanes of the photograph are as follows:- 
Lane i «xl74 Hae III DNA maricer 

2 EcoRI product digested with Cla I 

3 EcoRI product 

4 EcoRI product digested with Hinfl 

5 oXl74 Hae III ONA marker 
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Linear amplification applied to ihe Chlamydia genome 



The following were placed in a 0.5 ml Sarstedt tube 



10 ill 


Bam HI vectorette library 


20 ul 


5X Buffer (defined in Example 20) 


100 pmole 


Oligonucleotide 75. 


200 urn 


dNTPs 



The volume was adjusted to 100 ul with sterile, double distilled water. 50 ul of light mineral oil (Sigma) 

was gently pipetted over the reaction mix in order to prevent evaporation. 

The tube was then heated to 96^ (in the Techne Programmable Dri-6lock PHC-i which had been 
rs equilibrated 96^). The ONA was allowed to denature for 10 minutes at 96^. The block was then allowed . 

to cool to 90°C. Two units of Tag polymerase (Cetus) were added to the reaction mix. The following 

temperatures and times were employed to amplify the sequence between oligonucleotides 75 and the 

BamHI site. 

93f^ 0.7 minutes 
20 65X 0.7 minutes " 

72^ 1.5 minutes 

40 cycles were performed. 100 pmole oligonucleotide 58 and two units of Taq polymerase were then 
added to the reaction mix and amplification was continued for a further 20 cycles as descrit)ed atrave. An 
aliquot reaction mix was analysed by agarose gel electrophoresis (Figure 35). 
25 An addittonal reaction was performed in which a BamHI library was amplified using oligonucleotide 75 
and 58 under the reaction conditions described above. An aliquot of the reaction mix was analysed by 
agarose gel electrophorsis (Figure 35). 
In Figure 35:- 
Lane 1 shows the marker ^X174 cleaved with Hae III. 
30 Lane 2^.shows the product of linear amplification of a Bam HI vectorette library using oligonucleotide 

75. ... 

Lane 3 shows the product resulting from 20 cycles of secondary ampHficatkm with oligonucieotides 
58 and 75. 

l.ane 4 shows the product resulting from 35 cycles of secondary amplificatkMi with oligonucleotides 
35 58 and 75. 

Lane 5 shows the product resulting from 40 cycles of primary amplification with oligonucleotides 58 
and 75. 



40 Example 22 

A human genomic library was constructed using the cosmtd vector p Cos EMBL2 according to 
published procedures (PFR Utde (1987) In ONA cloning - A practical. approach Vol Hi p 19-42 Edited by OM 
Glover). The library was screened by. hybridisation to the probe KM19 (X Estivill et aJ (1987) Nature 326. 
45 840-845). Several positive clones were identified and one clone 4.17 was selected which contained the 
KM 19 sequertce In a 40 kb insert 

Cosmid 4J7 was digested with Hind 111 and a vectorette library was prepared according to Method X. 

Two oligonucleotide primers 76. 77 were synthestsed 
76 5 GGAAGGCCTTCAAAATTAACAGTGTAGCC 
50 77 5' GCTGCATCATATAAGTTGCC 

The positions of these primers is shown in Figure 36(a). 

PGR amplification was performed on aliquots of the library as described hereinunder. 
The following were mixed in a 0.5 ml Sarstedt tube. 

35 

O.I pg Cosfflid veccorecce library 
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100 pmoles Ollgonucleocide 38 
100 paoles Oligonucleotide 76 
100 \in dNTPs 

10 \il lOx buffer lOx buffer is 500 mM ECCl 

100 on Trls HCl pH 8.3 
10 oM «gCl2 
0.1 Z geladn 

A second reaction was set up containing the following in a 0.5 ml Safstedt tube. 



0.1 pg 


Cosmid vectorette library 


100 pmoles 


Oltgonucleotide 58 


100 pmoles 


Oligonucleotide 77 


100 uM 


dNTPs 


10 Lit 


lOx buffer 



The volume of the reaction mixes was adjusted to 100 ul witti sterile double distilled water. 50 ii\ of 
light mineral oil (Sigma) was gently pipetted over the reaction nriix in order to prevent evaporation, the tubes 
were then placed in a ThermaJ cycler (Techne Programmable Dri-8fock PHC-1) which had been equilibrated 
at 96°C. The block was allowed to cool to 90**C. two units of taq polymerase (Cetus) were added -to the 
reaction mix. The following temperatures and times were employed to amplify the sequences between 
oligonucleotides 58 and 76 and between 58 and 77. 
Qa^'C 2 minutes 

65**C 2 minutes . 
72?C 5 minutes 

40 cycles were performed. During the final cycle, the reaction mix was held at 72*'C for 10 minutes. 

Aiiquots (1 5 ui) were withdrawn from the reaction mixtures. Oye loading mix (5 ul) was added and the 
samples were analysed on 1.4% agarose gels. 
Oye loading mix 
15% (w/v) RcoH 400 
0.05% (wA/) Bromophenol Blue 
0.05% (w/v) Xylene Cyanol 
Dissolved in 1 x TBH 

The results are shown in Figure 36(b). 
Lane 1 ^X174 Hae III DNA marker 

2 230 bp product (oligonucleotide 58, oligonucleotide 78) 

3 1200 bp product ( oligonucleotide 58, oligonucleotide 77) 

4 «X174 Hae III ONA markers 



so 



E;<ample 23 

me roUowing oligonucleotides were syntnesisea. 

78 5' CTCTCCCTTCTCCGGTGATGCCGGCCACGATGCGTCCGGCGGTCCT(rrCCTTC 

A modified type E oligonucleotide containing a sequence representing nucleotide residues 385^14 of 
the tetracycline resistance gene of plasmid pBR322 (T Maniatis. EF Fritsch and J Sambrook (1982) 
Molecular Cloning - A laboratory manual p 479-503. To be used in conjunction with type 0 oligonucleotides. 

79 5' CCGGTGATGCCGGCCACGATGCGTCCGGCG 

A modified universal vectorette primer representing nucleotide residues 385-414 of the tetracycline 
resistance gene of plasmid pBR322 (Maniatis et aJ Molecular Cloning • A laboratory manual p 479-503). To 
be used with oligonucleotide 78 and oligonucleotides 80/81. 82/81 and 83/81 described hereinafter. 

80 5' AATTGGCGGCCGCCATCCTAATTCTQTCGAAGGTAAGGAACGGAGGAGAGAACT 
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A mcoified type 0 oligcnuctectide containing an EcoRi cohesive end. a Not \ recognition site and a 
Foki recognition site. To be used with oligonucleotide 61. 

81 5' AGTTCTCCCGGTGATGCCGGCCACGATGCGTCCGGCGGATGGCGGCCGCC 
Amodified type E oHgonucieotide 
5 82, 5' AGCTGGCGGCCGCCATCCTAATTCTGTCGAAGGTAAGGAACGGAGGAGAGAACT 

A modified type 0 oligonucleotide containing a Hind III cohesive end. a Not i recognition site and a Fok 

1 recognition site. To tie used with oligonucleotide 8i. 

83, S' AGCTGGCGGCCGCCATCC-NHz 

A modified type A oligonucleotide containing a Hind III cohesive end with an aminohexyl modification 
to blocking the 3 terminus. To be used with oligonucleotide 81 . 

84. s' GGCTCAACAGTCAGTTTGAACTAGC 

A primer adjacent to an EcoRi site in the KM 19 region. 

Restriction maps appropriate to each of the following amplificatiqns are shown in Rgure 37 in which 

(a) shows a restriction map of a segment of genomic ONA llgated to an EcoRt vectorette. The 
15 positions of oligonucleotides 27. 76. 78 and 79 are indicated. 

(b) shows a restriction map of a segment of genomic ONA ligated to an EcoRI vectorette. The 
positions of oligonucleotides 78. 79 and 84 are indicated. 

(c) shows a segment of the Phenylalanine Hydroxylase gene exon 9 ligated to an EcoRI vectorette. 
The positions of oligonucleotides 63 and 79 are indicated. 

20 (d) shows a restrictkin map of the segment of genomic ONA (described In 37(a) adove) ligated to a 

modified EcoRI vectorette. The positions of oligonucleotides 76. 79. 80 and 81 are indicated. 

(e) shows a restriction map of a segment of genomic ONA (described in 37(b) above) ligated to a 
modified EcoRI vectorette. The positkDns of oligonucleotides 79. 60. 81 and 84 are indicated. 

(f) shows a segment of the Phenylalanine Hydroxylase gene exon 9 ligated to a modified EcoRI 
25 vectorette. The positions of oligonucleotides 63. 79. 80 and 81 are indicated. 

(g) shows a retriction map of a segment of genomic ONA ligated to a modified Hind III vectorette 
library. The positions of oligonucleotides 76. 78 and 79 are indicated. 

(h) shows a restriction map of a segment of genomic ONA ligated to a modified Hind III vectorette 
library. The positions of oligonucleotides 78, 79. 81 and 82 are indicated. 

30 (i) shows a restriction map of a segment of genomic ONA ligated to a modified Hind III vectorette 

library. The positions of oligonucleotides 76. 79. 81 and 83 are indicated. 

(a) An EcoRI vectorette library was prepared from EcoRI digested genomic ONA (ceil line 7387) and 
oligonucleotides 27 and 79 according to Method X. An aliquot of the library (1 ng) was amplified with 
oligonucleotides 76 and 79. Amplification was performed as deiscribed in example 8. An aliquot of the 

35 reaction mix (15 ul) was analysed on a 1 .4% agarose gel. The results are shown in Figure 38. 

Lane i ^x174 Hae 111 ONA markers 

2 870 bp product resulting from amplification between oligonucleotides 76 and 79. 

(b) An aliquot (ing) of the EcoRI vectorette library described in the previous example was amplified 
40 with oligonucleotides 79 and 84. Amplification was performed as described in Example 8. An aliquot of the 

reaction mix (15 ul).Mas analysed on a 1.4% agarose gel. The result is shown in Rgure 38. 

Lane 3 520 bp product resulting from amplification between oligonucleotides 79 and 84. 

(c) An aliquot (1 ng) of the EcoRI vectorette library described in the previous examples was 
45 amplified with oligonucleotides 63 and 79. Amplification was performed as described in Example 3. An 

aliquot of the reaction mix (15 iiI) was analysed on a 1.4% agarose gel. The result is shown in Rgure 38. 
Lane 4 600 bp product resulting Irom amplification between oligonucleotides 63 and 79. 

(d) An EcoRI vectorette library was prepared from EcoRi digested genomic ONA (cell line 7387) and 
oligonucleotides 80 and 81 according to Method X. An aliquot of the library (1 ng) was amplified with 

50 oligonucleotides 76 and 79. Amplification was performed as described in Example 8. An aliquot of the 
reaction mix (15 ui) was analysed on t.4% agarose gel. The results are shown in Rgure 38. 

Lane 5 870 bp product resulting from amplification between oligonucleotides 76 and 79. 

(e) An aliquot of the library described in (4) above was amplified with oligonucleotides 79. 84. 
Amplification was performed as described in Example 8. An aliquot of the reaction mix was analysed on a 

55 1 A% agarose gel. The result is shown in Rgure 38. 

Lane 6 520 bp product resulting from amplification between oligonucleotides 79 and 84. 

(f) An aliquot of the library described in (4) above was amplified with oligonucleotide 63 and 79. 
Amplification was performed as described In Example 8. An aliquot of the reaction mix was analysed on a 
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1.4% agarose gel. The result is shown in Figure 38. 

Lane 7 600 bp product resulting from amplification t}etween oligonucleotides 63 and 79. 

(g) A Hind HI vectorette library was prepared from Hind III digested genomic ONA (cell line 7387) and 
oligonucleotides 30 and 8i according to Method X. An aliquot of the library (i ng) was amplified with 

5 oligonucleotides 76 and 79. Amplification was performed as described in Example 8. An aliquot of the 
reaction mix (15 ul) was analysed on a 1.4% agarose gel. The result are sfiown in Figure 38. 

Une 8 230 bp product resulting from amplification between oligonucleotides 76 and 79. 

(h) A Hind III vectorette library was prepared from Hind III digested genomic ONA (cell line 7387) and 
oligonucleotides 82 and 81 according to Method X. An aliquot of the library (1 ng) was amplified with 

10 oiigonucletides 76 and 79. Amplification was performed as described in Example 8. An aliquot of the 
reaction mix (15 ul) was analysed on a 1.4% agarose gel. The results are shown in Rgure 38. 

Lane 9 230 bp product resulting frpm amplification between oligonucleotides 76 and 79. 

(i) A Hind III vectorette library was prepared from Hind III digested genomic ONA (cell line 7387) and 
oligonucleotides 83 and 81 according to Method X. An aliquot of the library (i ng) was amplified with 

IS oligonucleotides 76 and 79. Amplification was performed as described in Example 8. An aliquot of the 
reaction mix (15 ul) was analysed on a i.4% agarose gel. The results are shown in Figure 38. 

Lane 9 230 bp product resulting frbm amplification between oligonucleotides 76 and 79. 
Lane 10 0x174 Hae III ONA markers. 

20 

Example 24 

The following oligpnudeptides were synthesised: 
85 S' ATTCTijTCCAGGAAACTTTGTQrrrrGTCA 
25 aiS .5' AGCCGAAGACAAAGGGCATAATCTC 

87 5 TCACTACCAGCTTTCCCCCACTTCCCTGGC . 

88 S' CCCTAAATTAGTCTCAGCTCCAGGTAAGiST 

QUA was extracted from cell line 7382 (previously demonstrated to possess the -M- haplotype for the 
KM19 Pstt polymorphism) and digested with Pstl. After extraction with phenol/chlofoform. ONA was 
30 recovered by ethanoi precipitation (see Figure 39(a)). 



Cifcularisation 

35 The following were added to a 0.5 ml Sarstedt tube: 



lOOng 


Pstt digested 7382 ONA 


10 tit 


10X ligation buffer 


10 ul 


lOmMATP 


0.5 ul 


Ti ONA ligase (Boehringer Mannheim 8 units/ul). 



The volume was adjusted to 100 ul with sterile, double distilled water and incubated for 2 hours at 
.45 25^*0. 5 identical reactions were performed and pooled after 2 hours at 25°C. The pooled mixture was 
extracted with phenol/chloroform and DNA was recovered by ethanoi precipitation. The ONA pellet was 
dissolved in 100 ul water and purified by gel filtrations on prepacked columns (G-50 Sephadex Nick 
columns. Pharmacia). The Of^ was eluted in 400 ul water. The solution was lyophilised and redissolved in 
water (9 ul) to give a final concentration of 50 ng/ul. 1 ul of this solution was diluted to 50 ul with water (see 
50 Figure 39(b)). 



Construction of vectorette library 

S5 

Hind III Oigestion 
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2ul 


PstI circles 000 ng) 


0.5 ui 


1 0X buffer 


I.Sal 


H2O 


1 ul 


Hind III (2 units) 



The reaction mixture was Incubated at 37^ for 30 minutes. 



Ligation 

The following were added to the restriction digest 



IS 


1 Ul 


1 OX buffer 




t Ul 


lOmMATP 




1 Ul 


Annealed oligonucleotides 81 and 82 (0.25 pmole) or 81 and 83 (0.25 pmole) 






ligase (2 units) 


20 


0.5 ul ' 


"1 OX Hind III buffer 



The mixture was Incubated for 30 minutes at 3T*C. 
Annealed oligonucleotides (81/82 or 81/83 0.2S p mole) were added arid the reaction was incubated for a 
25 further hour at 37^. The volume ol the reaction mixture was then adjusted to 100 ul with water (see Figure 
39(c)). 

1 . The followtng were added to a 0.5 ml Sarstedt tube: 
PstI circles (5 ng) 

oligonucleotide 85 (i 00 pmole) 
30 oligonucleotide 66 (100 pmole) 
lOuMOX PCR buffer 
lOOuMdNTPs 

2 units Taq polymerase (Cetus) 

The volume was adjusted to 100 ul with water and amplification was perfonned as described in 
OS Example 8. The predicted 710 bp product was obtained confirming that circles had been formed. The 
results are shown in Figure 40. ' 
Lane 1 «X174 Hae III DNA maricer 

2 710 bp product of amplification between oligonucleotides 85 and 86. 

2. The following were placed in a 0.5 ml Sarstedt tube: 
40 10 ng PstI circle-Hind III vectorette-oligonucleotides 81/82 

oligonucleotide 85 .( 1 00 pmole) 
oligonucleotide 79 (100 pmole) 
to ul 10 X PCR buffer 
100 uM dNTPs 
45 2 units Taq polymerase (Cetus) 

The volume was adjusted to 100 ul with water and amplification was performed as described in 
Example 8. 

The predicted 720 bp product was obtained. The results are shown in Rgure 40. 
Lane 3 720 bp product of amplification between oligonucleotide 85 and 79. 
50 3. The amplification reaction mixture described in (2) above was diluted by a factor of lO'. 1 ul of the 

diluted mixture was placed In a 0.5 ml Sarstedt tube. 

The following were also added to the Sarstedt tube. 

otigonucleotide 79 (IOO pmole) 

oiigonucleotide 88 (100 pmole) 
55 10 uMOX PCR buffer 

100 uM dNTPs 

2 units Taq polymerase (Cetus) 

The volume was adjusted to 1OO ul and amplification was performed as described in Example 8. The 
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predicted 500 bp product was obtained. The results are shown in Figure 40. 
Lane 4 500 bp product of amplification between oligonucleotides 79. 88. 

4. The amplification product from (3) above was digested with Pstl. The predicted products {340 bp. 
160 bp) was obtained. The result is shown in Figure 40. 

5 Lane 5 Products resulting from digestion of PGR product (Lane 4) with Pstl. Products of 340 bp and 

160 bp are obtained. 

5. The following were place in a 0.5 ml Sarstedt tube. 
10 ng Pstl circle*Hind 111 vectorette-oligonucieotlde 83/82 
oligonucleotide 85 (100 pmole) 

to oligonucleotide 79 (100 pmole) 
10 ul 10X PGR buffer 
100 uM dNTPS 

2 units Tag polymerase (Getus). 

The volume was adjusted to 100 ut with sterile, double distilled water and amplification was performed 
T5 as described in Bcample 8. The predicted 720 bp product was obtained. The results are shown in Rgure 
40. 

Lane 6 720 bp product of amplification between oligonucleotide 79 and 85. 

6. The amplification reaction mixture described in (5) above was diluted by a factor of 10^. 1 ul of the 
diluted niixture was placed in a 0.5 ml Sarstedt tube. The following were also added to the Sarstedt tube. 

20 oligonucleotide 79 (too pmole) 
oligonucleotide 88 ( 1 00 pmole) 
10 uMOX PGR boffer 
100 uM dNTPS 

2 units Tag polymerase (Getus) 
25 The volume was adjusted to 100 ul with sterile, double distiltod' water and amplification was performed as 
described in Example 8. The predicted 500 bp product was obtained. The results are shown in Figure 40. 
lane 7 500 bp product of amplification between oligonucleotide 79. 88. 

7. The amplification product from (6) above was digested with Pstl. The predicted products (340 bp. 
160 bP) were obtained: The msults are shown in Figure 40. 

30 Lane 8 Products resulting from digestion of PGR product (Lane 6) with Pstt. Products of 340 bp anid 

160 bp are obtained. 

8. The amplification reaction mixture described in (3) above was diluted by a factor of lO'. 1 ul of the 
diluted mixture was placed in a 0.5 ml Sarstedt tube. The following were added to the Sarstedt tube, 
oligonucleotide 79 

35 oligonucleotide 85 
10 ul 10X PGR buffer 
lOOuMdNTPs 

2 units Taq polymerase (Getus) 

The volume was adjusted to 100 ul with sterile, double distilled water and amplification was performed as 
40 described in Example 1 . The predicted 720 bp product was obtained. The results are shown in Rgure 40. 
Lane 9 720 bp product of amplification between oligonucleotide 79. 85. 

9. The amplification product from (8) above was digested with 'Pstl. The predicted products (370 bp. 
350 bp). 

Lane lO Products resulting from digestion of PGR product (Lane 9) with Pstl. 
45 Products of 370 bp and 350 bp were obtained. 
Lane 1 1 «X174 Mae 111 ONA markers. 
Sample from lane 3 of the agarose gel shown In Figure 40 was precipitated and loaded onto a fresh 
agarose gel. the photograph of which appears as Rgure 41. The lanes of the photograph in Rgure 41 are 
as follows:- 

50 Lane i «X174 Hae til ONA marker 

Lane 2 Sample form lane 3 of Rgure 40 
. Lane 3 oX174 Hae III ONA marker 
Lane 4 Amplification from lane 9 of Rgure 40. 

55 

Example 25 

Examples 3. 4. 5. 6 and 7 were repeated except that no dephosphorylatron step was effected. The 



73 



results are set out in Figure 42 in which the lanes of the photograph of the agarose gel are as follows:- 

Lane i oXi74 Hae III DNA marker 

Lane 2 the amplification product of Example 3(1) 

Lane 3 the amplifrcation product of Example 3(ii) 
s Lane 4 the amplification product of Example 4 

Lane 5 the amplification product of Example 5 

Lane 6 the amplification product of Example 6(it) 

Lane 7 the amplification product of Example 7(i) 

Lane 8 this lane was omitted 
10 Lane 9 oX174 Hae ill marker. 



Claims 

15 . A method for the amplification of a nucleic acid fragment comprising unknown sequence, by primer 
extension which method comprises cleaving a target nucleic acid to obtain target nucleic acid fragments, 
one of said fragments containing an initiating priming region of known nucleotide sequence for hybridisation 
with an initiating primer, preparing target riucleic acid fragment/vectorette units from the target nucleic acid 
fragments by ligation each unit having a vectorette priming region of known sequence for hybridisation with 

20 a vectorette primer,- and treating the target nucleic acid fragment/vectorette units, together or sequentially, 
with appropriate nucleoside triphosphates and an agent for polymerisation of nucleoside triphosphates 
under hybridising conditions, such that an extension product of an initiating primer is synthesised 
complementary to a single stranded target nucleic acid/vectorette unit having an initiating priming region to 
which is hybridised an initiating primer selected so as to be substantially complementary to the initiating 

25 priming region, whereas no such extension product is synthesised complementary to single stranded target 
nucleic acid fragmentvectorette units having no such initiating priming region. 

2. A method as claimed in claim i wherein the extension product is subjected to amplification in the 
presence of a vectorette primer which primer is. selected so as to be substantially complementary to a 
vectorette priming region. 

JO 3. A method as claimed in claim 2 wherein the synthesis of vectorette primer amplification products is 
dependent upon the initial synthesis of an extension product of the initiating primer. 

4. A method as claimed in claim 3 wherein the vectorette portion of the target nucleic acid 
fragmentvectorette unit comprises a double stranded portion having first and second strands, the second 
strand of the vectorette portion being tigated to that strand of the target nucleic acid fragment which 

35 contains the initiating priming region, and the nucleotide sequence of the first strand, second strand and 
vectorette primer being selected such that the vectorette primer is capable of hybridising to the com- 
plement of the second sti-and but not to the first strand under the same hybridisation conditions. 

5. A method as claimed in claim 3 wherein the vectorette portion of the target nucleic acid 
fragment/vectorette unit comprises a double stranded portion having first and second strands, the first 

40 strand having a terminal polymerisation btocking moeity and the second strand, whjch in use is ligated to 
that strand of the target nucleic acid fragment containing ttie initiating priming region, carrying a single 
sti-anded portion, the terminal polymerisation blocking moiety being effective to prevent extension of the 
first strand to form a complement to the said single stranded portion of the second strand in the presence 
of appropriate nucleoside triphosphates and an agent for the polymerisation of the nucleoside triphosphates 

05 under hybridising conditions. 

6. A method as claimed in any one of the preceding claims wrherein a plurality of different vectorette 
libraries is prepared for use with the same single initiating primer, each vectorette library comprising target 
nucleic acid fragment/vectorette units obtained by ligation of nucleic acid fragments prepared by cleaving 
target nucieic acio at diiferent cleavage sites: treating each vectorette library either separately or together 

50 with appropriate nucleoside triphosphates and an agent for polymerisation of nucleoside triphosphates 
under hybridising conditions whereby to obtain a plurality of initiating primer extension products, based on 
the use of the same single initiating primer. 

7. A method as claimed in claim i or claim 2 wherein a nucleic acid having termini capable of ligation 
to each other is circularised, the nucleic acid containing a portion there of being capable of serving as an 

55 initiating priming region for hybridisation with an initiating primer: cleaving the circularised nucleic acid 
outside the known nucleotide sequence to form a linear molecule containing the known nucleotide sequence 
and having a known cleavage site pattern at at least one terminus for ligation to form a target nucleic acid 
fragmentvectorette unit: forming said target nucleic acid fragment/vectorette unit by ligation and treating the 
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larget nucleic acid fragment vectorette unit, together or sequentially, with initiating primer, appropriate 
nucleoside triphosphates and an agent for polymerisation of the nucleoside triphosphates under hybricising 
conditions. 

8. A method as claimed in claim 7 wherein the target nucleic acid fragment vectorette unit is treated. 
5 together or sequentially with initiatirig primer, vectorette primer, appropriate nucleoside triphosphates and 

an agent for polymerisation of the nucleoside triphosphates. 

9. A method as claimed in any one of the. preceding claims wherein the target nuc'eic acid 
fragment'vectorette units are prepared from target nucleic acid, fragments by ligation such that the 
vectorette cannot be cleaved from the target nucleic acid fragment/vectorette units obtained therefrom by 

to the same agent used to cleave the target nucleic acid to yield target nucleic add fragments. 

10. A method as claimed in claim 9 whereiri the target nucleic acid is cleaved writh a restriction 
endonuclease to yield a target nucleic acid fragment for ligation to a vectorette. the sequence of the 
vectorette being selected such that the restriction endonuclease recognition sequence of the said restriction 
endonuclease is absent in the target nucleic acid fragment/vectorette unit. 

rs 11. A method as claimed in any one of the preceding claims wherein any or all of the initiating primer 
extension products obtained is (are) sequenced at least at the end(s) distai to a given initiating primer so as 
to determine the sequence of a further initiating primer whereby to obtain further initiating primer extension 
products based on primer extension of the further initiating primer. 

12. A method as claimed in any one of the preceding claims wherein an initiating primer extension 
20 product or portion thereof is sequenced whereby to characterise the said extension product or portion 

thereof 

13. A method as claimed in claim 11 or claim 12 for the identification of a genotype responsible for a 
given phenotype wherein the sequence of nucleic add containing the genotype is compared with the 
sequence of nucleic add containing no such genotype whereby to identify the genotype responsible for the 

25 given phenotype. 

14. A method as claimed in claim 13 wherein the comparison is effected by the use of a sir)gle iSample 
of nucleic add from an obligate heterozygote for the given phenotype. 

15. A metiiod as claimed in claim 1 1 or claim 12 for the identification of a genotype responsible for or 
contributory to a predisposition to a given phenotype if such be present wherein the sequences of nucleic 

30 acid from a plurality of individuals affected by the phenotype to be investigated are compared witii the 
sequences of nucleic acid from a plurality of individuals in which no evidence of the phenotype to be 
investigated is present whereby to identify a genotype responsible for or contributory to a predisposition to 
the phenotype to be investigated if such be present. 

16. A method as claimed in claim 15 wherein the sequence differerwres between 1) a pool of nucleic 
35 acids from a plurality of individuals affected by ttie phenotype to be investigated: and 2) a pool of nucleic 

acids from a plurality of individuals presenting no evidence of tiie phenotype to be investigated: are 
compared. 

17. A kit for the amplification of a nudeic add fragment of unioiown sequence by primer extension, 
which kit comprises:- 

40 1 ) means for cleaving a target nucleic acid at a specific site to obtain a target nucleic acid fragment; 

2) a vectorette adapted for ligation to a target nucleic add fragment obtained by use of the means for 
cleaving a target nucleic^acid defined in (1) whereby to form in use a target nucleic acid fragment/vectorette 
unit said vectorette having a vectorette priming region of known sequence for hybridisation with a vectorette 
primer 

45 3) each of four different nucleoside triphosphates: and 

4) an agent for polymerisation of the nucleoside triphosphates in (3). 

18. A kit as claimed in claim 12 which additionally comprises a vectorette primer and if desired at least 
one nested orimer. said primerts) having a nucleotide sequence substantially complementary to a vectorette 
priming region of a target nucleic acid fragment vectorette unit. 

50 19. A kit as claimed in claim 17 or claim 18 wherein the vectorette adapted for ligation to a target 
nucleic acid fragment comprises a double sti^anded portion having first and second stands, the second 
strand of the vectorette being adapted for ligation to that sti-and of the nucleic acid fragment which contains 
the initiating priming region, and the nucleotide sequence of the first strand and second strand being 
selected such that in use a vectorette primer is capable of hybridising to the complement of the second 
55 strand but not to the first strand under the same hybridisation conditions. 

20. A vectorene library kit for the amplification of a nucleic acid fragment of unknown sequence by 
primer extension, which kit comprises:- 

1) at least one vectorette library each library comprising a set of target nucleic acid 



75 



•'ragment vectorene units obtained from nucleotide sequences of an individual member of a species of 
animal, plant or microorganism: and 

2) an initiating pnmer or primers for hybridisation to an initiating priming region of the target nucleic 
acid fragmentvectorette units, 
s ' 21. A kit as claimed in claim 20 for the analysis of a genotype responsible for or contributory to a 
predisposition to a given phenotype wherein the set of target nucleic acid fragmentvectorette units 
comprises:- 

I) a pool of target nucleic acid fragmentvectorette units from a plurality of individuals affected by the 
phenotype to be investigated: and 
to 2) a pool of target nucleic acid fragmentvectorette units from a plurality of individuals presenting no 

evidence of the phenotype to be investigated. 
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(a) Fig.U. 
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(iii) 



Fig. 5. 



i) Blocking vectoretce 

5* N^N^N^N^N^ AAGGAGAGGX 3* 
llMMilll 

3' n5' TTCCTCTCCTGTCCCTAAGAGCATCCrrcCCAATGCTAAGC 5* 



11) Non-conplenentary vectorette 

GCTGTCTGTCGAAGGTAAGGAACGGACGA 
5< N^N^N^N^N^ AAGGAGAGGAC I I III I 1 I II II GAGAAGGGAGAG 3* 
n I II I 1 1 n 1 1 I I 1 1 I I I t M II I i t M I I M M I 
31 jj5» TTCCTCTCCTG I I III I I I M II CTCTTCCCTCTC 5* 
TCGCTAAGAGCATGCTTGCCAATGCTAAG 
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Fig. 28. 

1 ATGCCTGCAG GTCGACTCTA GAGGATTCCC CTAGAGCATA TAAAATTATT 

51 TTCAAGGGAA GATCTAAAAA TACGTATGAA GAAGTTCTGG TACTTTTTTC 

101 cccAcccAcc' AGATCAc^ rr rmTm mrrrmT tttttttttt 

151 TATCACTTGA -GTCmTGCA CTGCTCTTTA AAGATTCCTG CTCTCAGGAG 

201 CACCTAAGCG TAGAATAGCA ACTGGCACTT CaiGGCCAG GACATGCCTT 

251 CTGGAGGTGC TTATGGAGAT GCACTGGAGT CTCGGTCTTC GAAAGTAGGC 

1^01 ATGTTTGATG GGCCAGTTTT ACATTAAGTT CTGTAGTCTC ATTGCATCTG 

351 -GCACCCTCAA ATTTCTTATT AAATACTACA . GCCTCGT^ TAGCTTTGTT 

v4bl. CCATAGTAGG A AAAGTTTGA GCATAGGAAA > AGTTCTGTGC Cq AAAATTRr 

A51 ATCg^^AAT TGGGTACCGA GCTC;.,,! ' 72 



. .-^ i;: - AGA7GCTCTA CACATAAAAG GACAAGCTGA ACTCCTGCaa CtCaCTi'AaC 

cV Hlnfl Hlnfl 

GAGAACTATT AGAATCGGCA CATTTGAACA GACAAGTCGC TGAATCCAAG 

101 ATGAAGCCTT TTATTTTTAA GAAAAAAATG GCCTTTACAT CACCGACAAA 

151 TTAGGTTCAG ATGAAAAAGG CTGTTTC 
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